Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3945 |
Symbol | |
ID | 9247816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4716985 |
End bp | 4718427 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003681848 |
Protein GI | 297562874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCAC AGAGCACAGA CGCGCCCAAG AAGCGCAGAG CCCTCCGTTT CCCGTTCGCG GCACAGGTCC TGGCGGGCCT GGTCCTCGGC GTCGTCCTGG GCGCCGTCGC CTTCCAGCTC AGCCCGCCCG ACCCCGAGAC CGGTGACGCC ACCAACTGGC TCGCCGTCAC CCTCGACACC GTCGGCTCGA CCTTCGTCAC CCTGCTGCGG ACCATCGTCC CGCCGCTGAT CGTCCTCGCC GTCATCTCCT CCATCGCCAA CCTGCGCAAC GTCACCAACG CGGCGCGACT GGCATGGAAG ACCCTGCTGT GGTTCGGCGC GACCGCCCTG GTCGCGGTGG GCATCGGCAT CGCCCTGGGC CTCATCGTGC GGCCCGGCGT CAACAGCGGC GTGGACGCGG CCACCGCCGC CGAGCCCTCG GGCACCGGCT CCTGGATGGC GTTCATCGAG AGCATCGTCC CCGCCAACTT CCTCGGCCTG GGCGCGAGCG TCTCCGGCGC CGACGACGGC AGCTTCAGCG CCTCGCTGGA CTTCAACGTC CTCCAGCTGA TCGTCATCGC CATCGTCCTG GGCATCGCCG CGATCAAGGT CGGCAAGGCC GCCGAGCCCT TCATCGCCTT CACCGACTCC GCGCTCCAGG TCGTGCTCAA GGCCCTGTGG TGGATCATCC GCCTGGCCCC GATCGGCACC GTGGGCCTGC TCGGCAACGC CGTGTACAGC TACGGCTGGA CCACCATCGG CGCCCTGGGC AAGTTCGCGC TGACCATCTA CATCGGCCTG GCGCTGGTGC TGTTCGTCGT CTACCCGGTG CTCGCCCGCG TCAACGGCCT GTCCCCGGTG AAGTACTTCA GCGGCGTGTG GCCCGCGGTC CAGCTGGCCT TCGTGTCCCG CTCCTCGATG GGCACCATGC CCGTCACCCA GCGCGTGACC GAGCAGAACC TGGGCGTGCC GCGCTACTAC TCCGCGTTCG CCGTGCCCTT CGGCGCCACC ACCAAGATGG ACGGCTGCGC CGCGATCTAC CCGGCGATCT CGGCGATCTT CATCGCCCAG TTCTTCCCCG GCATCAGCCT CGGCCTCACC GACTACCTGC TGATCGTGTT CGTGTCGGTG ATCGGCTCGG CCGCGACCGC CGGAACCACC GGTGCGACCG TCATGCTGAC CCTGACCCTG TCCACCCTGG GCCTGCCCCT GGAGGGCGTG GGCCTGCTGC TGGCCGTCGA CCCGATCCTG GACATGGGCC GCACCGCGGT CAACGTGGCC GGTCAGGCGC TGGTCCCGGC CATCGTCGCC AAGCGTGAGG GCATCCTGGA CGTGGCCCGC TACCACGCGC CGCGCGACGC CTCCGGCTTC GTGCCGCTGG ACGAGGCGCA CGACGACGTC GAGGGCGAGG TCGGCGAGCC CGGCGTGAAG AAGGTTCCCG CCGGAGCCTC CGGCCAGGGC TGA
|
Protein sequence | MSAQSTDAPK KRRALRFPFA AQVLAGLVLG VVLGAVAFQL SPPDPETGDA TNWLAVTLDT VGSTFVTLLR TIVPPLIVLA VISSIANLRN VTNAARLAWK TLLWFGATAL VAVGIGIALG LIVRPGVNSG VDAATAAEPS GTGSWMAFIE SIVPANFLGL GASVSGADDG SFSASLDFNV LQLIVIAIVL GIAAIKVGKA AEPFIAFTDS ALQVVLKALW WIIRLAPIGT VGLLGNAVYS YGWTTIGALG KFALTIYIGL ALVLFVVYPV LARVNGLSPV KYFSGVWPAV QLAFVSRSSM GTMPVTQRVT EQNLGVPRYY SAFAVPFGAT TKMDGCAAIY PAISAIFIAQ FFPGISLGLT DYLLIVFVSV IGSAATAGTT GATVMLTLTL STLGLPLEGV GLLLAVDPIL DMGRTAVNVA GQALVPAIVA KREGILDVAR YHAPRDASGF VPLDEAHDDV EGEVGEPGVK KVPAGASGQG
|
| |