Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1913 |
Symbol | |
ID | 8447520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2107585 |
End bp | 2108634 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645041043 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003201291 |
Protein GI | 258652135 |
COG category | [C] Energy production and conversion |
COG ID | [COG0371] Glycerol dehydrogenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00367507 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000286832 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACATCC GGATCCCTTC GCTGCTGCGG ATCAAACCCG ACACCCTGTT CAAGCTGGGC AAGTACCTGC GCAAGCACGG CTTCGACCGC ATCGCGCTGT GCTACGGCGA GGGCATCGAG GAGCTGGTCG GCCGCAGCAT CCGCATCTCC CGGGATTCCT CCGAGATCAC CGTGGTCCGG CAGGAGGTCG TGCACAGCAA CGACGTCACC GACATCATGG GGGTGGCGTT CCACCTGCCC CGCGGTACCC AGGCCGTGGT GGCCGTCGGC GGCGGCGTCG CGGTCGATGC GGGCAAGTAC ATCGGCTTCC TCAACCAGCT CCCGGTGGTC GCGGTGCCCA CCGCCATCTC CAATGACGGG TTCGCCTCCC CCGGCGCCAG CCTGCGGGTG GAAGGGCAAC GGATCAGCGC CAAGGCGGCC ATCCCGTTCG GCGTCGTCAT CGACACGACG GTGATCGCCG CCTGCCCGCC CCGGTTCACC CTGTCCGGCA TCGGCGACCT GATCTCCAAG TACAGCGCCA TCGCCGACTG GAAGCTCTCG TACCACGCGA CCGGCGAGGC CATCAACGAC TTCTCCGCGA TGATCGCGCT GCAGAGCGTG GAGAACCTGG TCAACCACCC GGACAAGTCG ATCGAGGACC TGGGGTTCCT GCAGCTGGTC TGCGGGGCGC TGGTGATGAG CGGGGTGTCG ATGGAGGTCG CCGGCTCGTC CCGGCCCGCC TCGGGCAGCG AGCACCTGAT CTCGCACGCC TACGACCGGC TGGCCGCCCG GCCGCGGATG CACGGCGAGC AGGTTGGGGT GGCCACGATC GCCACCACCT GGCTGCAGGA CAACCCGCAG CGCGACACCG TGCTGCGCGT GCTGGAGCAG ACCACCTTCC TCAGTGCCAT CCGGGCCGAT CCGCTGGACC GGGCCACCTT CCTGGCCGCC ATCGCCGCCG CCCCGGCGGT CAAACCCGGC TACCACACGG TGCTGTCCGA GCCGGGCGCG GTCGAGCGGC TGCAGGCCCA TATCGCCGCG GACCCGCTCT GGCAGGATCT GCTGGCCTGA
|
Protein sequence | MDIRIPSLLR IKPDTLFKLG KYLRKHGFDR IALCYGEGIE ELVGRSIRIS RDSSEITVVR QEVVHSNDVT DIMGVAFHLP RGTQAVVAVG GGVAVDAGKY IGFLNQLPVV AVPTAISNDG FASPGASLRV EGQRISAKAA IPFGVVIDTT VIAACPPRFT LSGIGDLISK YSAIADWKLS YHATGEAIND FSAMIALQSV ENLVNHPDKS IEDLGFLQLV CGALVMSGVS MEVAGSSRPA SGSEHLISHA YDRLAARPRM HGEQVGVATI ATTWLQDNPQ RDTVLRVLEQ TTFLSAIRAD PLDRATFLAA IAAAPAVKPG YHTVLSEPGA VERLQAHIAA DPLWQDLLA
|
| |