Gene Daro_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3921 
Symbol 
ID3567652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4215811 
End bp4217727 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content61% 
IMG OID637682395 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_287119 
Protein GI71909532 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA CCGAACAATT CCTCGCCGCC AACGCCCACG TCGACGAGGC TGCAGTCCAG 
CCGCTGCCCA ATTCCCGCAA AATCTACGTT GAAGGTTCGC GCCCGGATAT TCGGGTGCCG
ATGCGCGAAG TGTCGCAGGA CGACACGCCG ACCGCCTTCG GTGGAGAAAA GAACCCGCCG
ATCTACGTCT ATGACTGCTC CGGCCCCTAT TCCGACCCGG CCGCCAAGAT CGACATCCGT
TCCGGCCTGC CGGCGCTGCG TGCCCAGTGG ATTGCCGAGC GCGGCGATGT CGAGGCGCTG
GCCGATTTGA GTTCCGAGTT CGGCCGCCAG CGTGCCGCCG ACCCAAAACT CGACGAACTG
CGCTTCCCCG GCCTGCACCG CAAGCCGCTG CGCGCCAAGG CCGGTCAGAA CGTTTCGCAG
ATGCACTACG CCCGCCGGGG CATCATCACG CCGGAGATGG AATACGTCGC CATCCGCGAG
AACAACAACC GCCGCGCTTA CATTGAAAGC CTGAAGGCCA CCGGCCCGAT GGGTAACCGG
ATGGCCGACA TTCTCGGCCG CCAGCACAAG GGCCAGGATT TCGGCGCCAG CATTCCGGAA
GAAATCACCC CGGAATTCGT CCGCAGCGAA ATCGCCCGCG GCCGCGCCAT CATCCCGAAC
AACATCAACC ACCCGGAAAG CGAGCCGATG ATCATCGGCC GCAATTTCCT GGTCAAGATC
AATGCCAACA TCGGCAACTC GGCGCTCGGC TCCTCGATTC AGGAAGAAGT CGAAAAGATG
ACCTGGTCGA TCCGCTGGGG CGGCGACACG GTGATGGACC TGTCCACCGG CAAGAACATT
CACGAAACGC GTGAATGGAT CATCCGTAAC AGCCCGGTGC CAATCGGCAC GGTGCCGATC
TATCAGGCGC TGGAAAAGGT TAACGGCAAG GCCGAAGACC TGACCTGGGA AATCTTCCGC
GACACACTGA TCGAACAGGC CGAACAGGGC GTCGACTACT TCACCATCCA CGCCGGCGTC
TTGCTCCGCT ATGTGCCGAT GACCGCCAAC CGCCTGACCG GCATCGTTTC CCGCGGTGGC
TCGATCATGG CCAAGTGGTG TCTGGCCCAT CACAAGGAAA GCTTCCTCTA CACGCATTTC
GAGGAAATCT GCGAAATCAT GAAGGCCTAC GACGTCGCCT TCAGCCTCGG CGACGGCCTG
CGTCCCGGTT CTATCTACGA TGCCAACGAC GAAGCGCAAC TCGGCGAATT GGAGACGCTG
GGCGAACTGA CCAAAATTGC CTGGAAGCAC GACGTTCAGG TCATCATCGA AGGCCCGGGT
CATGTGCCGA TGCACATGAT CAAGGAAAAC ATGGACCTCC AGCTCAAGCA TTGCGACGAA
GCTCCGTTCT ACACCCTCGG TCCCTTGACC ACCGATATTG CACCGGGCTA CGACCACATC
ACCAGCGGCA TCGGTGCCGC GATGATCGGC TGGTACGGCA CTGCCATGCT CTGTTACGTC
ACGCCGAAAG AGCACCTTGG CCTGCCCGAC AAGGATGACG TCAAGGAAGG CATCATCACC
TACAAGCTCG CCGCCCACGC CGCCGACCTC GCCAAGGGCC ATCCCGGCGC GCAGATCCGC
GACAACGCGC TTTCCAAGGC ACGCTTCGAA TTCCGCTGGG ATGACCAGTT CAACCTCGGC
CTCGACCCGG ACAAGGCGCG CGAATTCCAC GACGAAACCC TGCCCAAGGA ATCAGCCAAG
GTCGCTCACT TCTGCTCCAT GTGCGGCCCG CACTTCTGTT CGATGAAGAT CACCCAGGAA
GTCCGGGAGT TCGCTGCACA GCAAGGGCTG GATGAAGCGG CTGCCCTGGA GAAGGGGATG
GAAGTGAAAT CGGTTGAGTT TGTGAAGGCC GGCGCTGAGG TTTATAGCAA GATCTAA
 
Protein sequence
MNATEQFLAA NAHVDEAAVQ PLPNSRKIYV EGSRPDIRVP MREVSQDDTP TAFGGEKNPP 
IYVYDCSGPY SDPAAKIDIR SGLPALRAQW IAERGDVEAL ADLSSEFGRQ RAADPKLDEL
RFPGLHRKPL RAKAGQNVSQ MHYARRGIIT PEMEYVAIRE NNNRRAYIES LKATGPMGNR
MADILGRQHK GQDFGASIPE EITPEFVRSE IARGRAIIPN NINHPESEPM IIGRNFLVKI
NANIGNSALG SSIQEEVEKM TWSIRWGGDT VMDLSTGKNI HETREWIIRN SPVPIGTVPI
YQALEKVNGK AEDLTWEIFR DTLIEQAEQG VDYFTIHAGV LLRYVPMTAN RLTGIVSRGG
SIMAKWCLAH HKESFLYTHF EEICEIMKAY DVAFSLGDGL RPGSIYDAND EAQLGELETL
GELTKIAWKH DVQVIIEGPG HVPMHMIKEN MDLQLKHCDE APFYTLGPLT TDIAPGYDHI
TSGIGAAMIG WYGTAMLCYV TPKEHLGLPD KDDVKEGIIT YKLAAHAADL AKGHPGAQIR
DNALSKARFE FRWDDQFNLG LDPDKAREFH DETLPKESAK VAHFCSMCGP HFCSMKITQE
VREFAAQQGL DEAAALEKGM EVKSVEFVKA GAEVYSKI