Gene Ndas_5074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5074 
Symbol 
ID9248963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp218337 
End bp219680 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content70% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003682961 
Protein GI297563988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACA GAGAAACGGA GACGGCCGTG AAGACACCCC CCACAGCACT CCTGTCCGCC 
TCGACGGCGC TGGTACTGGC CGCGGCACTG ACCGGCTGCG GTTCCGGCGA GGATTCCGGT
GACCAGACGC TGACCTACTG GGCCAGCAAC CAGGGAGCGA GCGTGGAGGA GGACCGGGAG
GTCCTCCAGC CCGTGCTGGA CCGCTTCACC GAGGAGACCG GGGTGGAGGT CGAGCTGGAG
GTCATCCCCT GGAGCGAGCT GTACAACCGC ATCCTCACCG CGGTGAGCAG CGGCGACGGC
CCCGACGTGC TCAACATCGG CAACACCTGG GCGGCCAGCC TCCAGGAGAC CGGCGCCTTC
GTGCCCTACG AGGGCGCGGA CCTGGAGGCG GTGGGCGGCG AGGGGCGCTT CGTCGGCACC
AGCTTCGCCA CCGGCGGCGC CGAGGGCCAG ACGCCGACGT CGGTGCCGCT CTACGGGCTG
TCCTACGCGC TGTTCTACAA CCCCACCATG TTCGAGGAGG CGGGCATCGA GGAACCGCCC
GCGACCTGGG ACGAGTTCGT CGACACCGCG GACGAGCTGA CCAGGGACAC CGACGACGAC
GGCGACGTCG ACCAGTACGG GTTCGTGCTG GAGGGCGGCA ACGAGCGGCA GAACTCCCAC
ATGGCCTTCA TCCTCGGCCA GCAGCAGGGC GGACGGCTGT GGGGCGAGGA CGGGCCCTCC
TTCTCCTCCG ACGAGCAGGT CGCCGCGGTC AAGCAGTGGG TGGACCTGAT GGCCGTGGAG
GAGGTCGTCG ACCCCAGCAG CGCCGAGTTC AGCGACGGAA CCCAGGGCAT CAGCGACTTC
GTCGACGGGC GCGCGGCCAT GATCATCGTG CAGGGCAGCG CCCGCACCAG CATGGCCGCC
CGCGGTTTCG AGGACTACGA GGTCGCCCAG GTGCCGATGC TCGACCCGCT GCCGGGCGAG
CCCATCCAGA GCCACGCGGC CGGGATCAAC ATCAGCGTCT TCAACGACAC CGACGACAAG
GAGGGCGCTC TGCGGCTGGT CGAGCACCTG ACCAGCCCGG AGGAGCAGGT GTACCTGTCC
CAGGAGTTCC AGACGCTGCC GGTGGCCACC GAGGCCTACG ACAGCGAGGA GCTGCGGAGC
GAGTCCATGG AGACCTTCCG CACGATCCTG ACCGAGCACT CCGCTCCGAT GCCGCTGATC
CCCGAGGAGG GCCAGATGGA GACGGTGCTC GGCGAGGCGA TCGGCGGGCT CTTCGCCCGG
GTGGCGACCG GAGACGAGGT CACCGAGGCC GACGTCCGCC AGGCCATGGA GGCGGCCGAG
ACCCAGATGG ACGCCGCGAA CTAG
 
Protein sequence
MAYRETETAV KTPPTALLSA STALVLAAAL TGCGSGEDSG DQTLTYWASN QGASVEEDRE 
VLQPVLDRFT EETGVEVELE VIPWSELYNR ILTAVSSGDG PDVLNIGNTW AASLQETGAF
VPYEGADLEA VGGEGRFVGT SFATGGAEGQ TPTSVPLYGL SYALFYNPTM FEEAGIEEPP
ATWDEFVDTA DELTRDTDDD GDVDQYGFVL EGGNERQNSH MAFILGQQQG GRLWGEDGPS
FSSDEQVAAV KQWVDLMAVE EVVDPSSAEF SDGTQGISDF VDGRAAMIIV QGSARTSMAA
RGFEDYEVAQ VPMLDPLPGE PIQSHAAGIN ISVFNDTDDK EGALRLVEHL TSPEEQVYLS
QEFQTLPVAT EAYDSEELRS ESMETFRTIL TEHSAPMPLI PEEGQMETVL GEAIGGLFAR
VATGDEVTEA DVRQAMEAAE TQMDAAN