Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3665 |
Symbol | |
ID | 9247534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4399972 |
End bp | 4401336 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | citrate/H+ symporter, CitMHS family |
Protein accession | YP_003681569 |
Protein GI | 297562595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.393946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCG CCATGGGTTT CGCCACGATC GCCGTGGTGC TCCTGCTGCT GTTGTCCAAC CGGGTGGGCG CGGTGGTCGC GCTCGTCGGC GTCCCGGTGG CGGCCGCCTT CGCGCTGGGC TTCGGCCCCG GCGAGGTGGC CTCCTTCGTC GGCGAGGGCA TCGGCGGCGT GGCCTCCACG ACCGCGATGT TCGTGTTCGC CATCCTCTAC TTCGGGGTGA TGCGCGACGC GGGGCTGTTC GCGCCGATCA TCCGCCGCGT GCTCGGGTTC GCCGGGAACA CGCCGGTGAC CGTGGCGGTC GCGACGGTGG TGCTCGCGAT GGTCGCCCAC CTGGACGGCG CGGGCGCCAC CACCTTCCTC ATCACCATCC CGGCGATGCT CCCGCTCTAC GACGCCCTGG GCATGCGCAG GGTGGTCCTG GCCGCGCTGG TCGGGCTGGG CGCGGGGATC ATGAACATGC TGCCGTGGGG CGGGCCCACC GCCCGGGCCG CCACCGTCCT CGGCGTTCCG GCCAACGAAC TGTGGGCCCC GCTGATCCCG GCCCAGCTGG CGGGCATGGC CGCCTGCGTC GCCGTCGCCT GGTACCTCGG CCACCGCGAG CGCGTGCGGC TGGCGGCGGC CGACCCCCTG CCCTCCCCGG TGCCCGTCGG GGGCGGCCGC CCGGGCGGGA CCGCCGGAGA GGGCGGGGCC GACCCGCTCG CCGCGGGCGG GCAGGAGCCC GACGCCGACC TGCTGCGCCC GCGCCTGTAC TGGGTGAACG CCGCGCTGAC CGTCGCCGCC GTCCTCGCCC TGGTCTTCGG GCTGCTCTCC CCCGAACTCG TCTTCATGCT GGCCCTGGTC GTGGCGCTCG TCGTCAACTA CCCGGGGATG AAGGCCCAGA CCGACCGCGT CAACGCCCAC GCCTCCGGGG CGATCCTCAT GGCCAGCACC CTGCTCGCCG CGGGCGTGTT CCTGGGCGTC ATGGACAGCA GCGGCATGAT CGAGGCGATG GGACAGGCCA TGACCGGCGC CATGCCCGGT TTCCTCGGCC CCGGGATGGC CGCGATCGTC GGTGTCCTGG GCGTGCCCAT GAGCCTGCTC TTCGGCCCCG ACGCCTACTA CTTCGCCGTC ATGCCGGTGC TGACCGCCGT GGGCGAGGGC TTCGGCGTCG CCGCCGCCGA CATCGCCCAG GCCTCGATCA TCGGCCAGGA GACCGTCGGC TTCCCGATCA GCCCGCTGAC CGGCTCCTTC TACCTGCTGG TCGGCCTGGC CGGGGTGCCC ATCGGCTCCC ACATCCGCTT CCTGCTGCCG TGGGCCTGGC TGGTCAGCCT GGTCGTCCTC GCGGTCGCCC TCGCCAGCGG GGTCGTCCCG CTCTGGGTCG GCTGA
|
Protein sequence | MLTAMGFATI AVVLLLLLSN RVGAVVALVG VPVAAAFALG FGPGEVASFV GEGIGGVAST TAMFVFAILY FGVMRDAGLF APIIRRVLGF AGNTPVTVAV ATVVLAMVAH LDGAGATTFL ITIPAMLPLY DALGMRRVVL AALVGLGAGI MNMLPWGGPT ARAATVLGVP ANELWAPLIP AQLAGMAACV AVAWYLGHRE RVRLAAADPL PSPVPVGGGR PGGTAGEGGA DPLAAGGQEP DADLLRPRLY WVNAALTVAA VLALVFGLLS PELVFMLALV VALVVNYPGM KAQTDRVNAH ASGAILMAST LLAAGVFLGV MDSSGMIEAM GQAMTGAMPG FLGPGMAAIV GVLGVPMSLL FGPDAYYFAV MPVLTAVGEG FGVAAADIAQ ASIIGQETVG FPISPLTGSF YLLVGLAGVP IGSHIRFLLP WAWLVSLVVL AVALASGVVP LWVG
|
| |