Gene Ndas_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3122 
Symbol 
ID9246978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3737500 
End bp3738789 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_003681037 
Protein GI297562063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGT CGCTTGAGTC CGCGCCGCTC CCCGCAGCCC TGGCCGGGTC CGACGCCGGG 
CTCGGCGAGC TCTGCGTGCT CATGAAGCTC GGCGAGATCG TCCTCAAGGG CTCCAACCGC
AAGCTGTTCG AGCGGCGGCT GCACAACAAC ATCCGCGCCT CCGTGCGCGA CCTGGGCGAG
GTCCGCCTCT CCCAGCGCGG CGCGGGCGTC ATCATCGTGC GCAAGCCCGA CGCCTCCGAC
CTGGAGGTCG CCGAGATCGC CGACCGCATG GCCAACGTCA TGGGTGTGGT CTGGGTGCAC
CTGGTCCGCC GCGTGCCCAA GGACCTGGAC GCCGTCACCG ACATCGGCGT GCGCGTCATG
CAGGGCCGCG AGGGCACCTT CGCCGTGCGC GCCCGGCGCC GCGACAAGCG CTTCGAGATG
ACCTCGTCGG AGCTGGCCGG GTACCTGGGC TCGAAGATCA TCGAGGCGCA CGGCTACAAG
GTCAACCTCA AGCGCCCCGA CAACACCCTG TTCGTCGAGG TGGACAAGGA CGAGGCGTTC
GTGTTCACCG ACGGCGTGCC CGGCCAGGGC GGCCTGCCCG CCGGGATGAG CGGCCGCGGC
CTGGTGCTGA TGTCGGGCGG GATCGACTCA CCCGTCGCCG CGCACCGGAT GATCCGGCGC
GGGCTCAAGG TCGACTTCCT GCACTTCTCC GGCATGCCGT TCACCGGCCC GGAGTCGATC
TACAAGGCCT ACAGCCTCGT CCGCCAGCTC GACCGCTACC AGGTGGGCTC GCGGCTGTTC
GTCATCCCCT TCGGCAAGGC CCAGCAGCAG CTGAAGAGCT CGGGGATCGA GCGGCTCCAG
ATCGTCGCCC AGCGCCGCCT CATGCTCAAG ACCGCGGAGG CCCTGGCCGA CGACCTGGGC
GCGGAGTGCC TGGTCACGGG GGACGCGCTG GGCCAGGTGT CCAGCCAGAC CATGACCAAC
CTGACCGCCC TGGACGACGC GGTGGACCTG CCGATCCTGC GTCCGCTCAT CGGCATGGAC
AAGACCGAGA TCATGGACCA CGCCCGCCGG ATCGGGACCC TGTCCATCTC GGAGCTGCCC
GACGAGGACT GCTGCACCAT GCTGACCCCG CGCCAGGTGG AGACCGCGGC CAAGATCCCG
GACCTGCGCC AGATCGAGAA GCGCCTGGAC GCCGAGGAGC TGGCCGAGCA CCTGGTCACC
ACCGCGCAGG TGCACAAGCC CAGCTTCCTG GGCGACGCCG CGCCCAAGCG CGTGGCGCCC
GCCTCCGCGT CGGTGGCCGC CACCGCCTAG
 
Protein sequence
MSASLESAPL PAALAGSDAG LGELCVLMKL GEIVLKGSNR KLFERRLHNN IRASVRDLGE 
VRLSQRGAGV IIVRKPDASD LEVAEIADRM ANVMGVVWVH LVRRVPKDLD AVTDIGVRVM
QGREGTFAVR ARRRDKRFEM TSSELAGYLG SKIIEAHGYK VNLKRPDNTL FVEVDKDEAF
VFTDGVPGQG GLPAGMSGRG LVLMSGGIDS PVAAHRMIRR GLKVDFLHFS GMPFTGPESI
YKAYSLVRQL DRYQVGSRLF VIPFGKAQQQ LKSSGIERLQ IVAQRRLMLK TAEALADDLG
AECLVTGDAL GQVSSQTMTN LTALDDAVDL PILRPLIGMD KTEIMDHARR IGTLSISELP
DEDCCTMLTP RQVETAAKIP DLRQIEKRLD AEELAEHLVT TAQVHKPSFL GDAAPKRVAP
ASASVAATA