Gene Bpro_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_0037 
Symbol 
ID4011756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp39663 
End bp41579 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content63% 
IMG OID637939723 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_546902 
Protein GI91785950 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA CCCGCCTCAC CGTTGACACC AACGACCTCG CAAGCCGTAT CACGCGCGCG 
CCTTTTCCCG GTTCCGCCAA AATTTATATC GAGGGTTCGC GCCCCGACAT CCGCGTGCCG
TTTCGCGAGG TCACGCTGAC CGACACGATG GTGCACGAGG GGGCGGGCGA GCCGCGCCGC
GAAGCCAATC CGCCGCTGCG CCTGTATGAC GCCTCGGGCG TCTACACCGA CCCGGCGTCG
CCGATCGACA TCACGCGCGG CTTGCCGCCT TTGCGCGGCG CCTGGATCAA TGAGCGCGCC
GACACCGAAG CGCTGCCCGG CATCAGCAGC GCCTATGGCC GCGAGCGCCT GAACGACCCG
GCGCTGGCCG CCCTGCGCAT GGCGCATGCG CCCGTGCCGC GCCGCGCCAA GGCCGGTGCC
AACGTCTCGC AAATGCATTA CGCCCGCAAA GGCATCATCA CGCCGGAGAT GGAATACATC
GCGGTGCGCG AAAACCTGGT GCGCGCGCAG CTGGCCGAGC GCCTGGCGAC CGAGCGCATG
CCGAAGAAAG GGCACTCCTT CAACGCCTCG ATTCCGGAAC AGATCACCGC TGAATTTGTG
CGTGATGAAG TGGCGCGCGG CCGCGCCGTG ATTCCGAACA ACATCAACCA CCCCGAGAGC
GAGCCGATGA TCATCGGCCG CAATTTCCTG ATCAAGGTCA ATGCCAACAT CGGCAACTCG
GCCGTGACGT CTTCCATCGA GGAAGAGGTG GACAAGCTGG TCTGGTCGAT CCGCTGGGGC
GCCGACACCG TGATGGACCT GTCGACCGGC GAGAACATCC ACGAGACGCG CGAATGGATT
CTGCGCAACT CGCCGGTGCC GATCGGCACG GTGCCGATTT ACCAGGCGCT GGAAAAAGTC
AACGGCAAGG CCGAAGACCT GACCTGGGAA ATCTTCCGCG ACACGCTGAT CGAGCAGGCC
GAGCAGGGCG TGGACTACTT CACCATCCAC GCCGGTGTGC GCCTGGCCTA TGTGCCGCTG
ACCGCCAACC GCCTGACGGG CATCGTCTCG CGCGGCGGCT CCATCATGGC CAAGTGGTGT
TTGTCGCACC ACAAGGAAAG CTTTTTGTAC GAGCGCTTTG ACGAGATCTG CGAGATCATG
AAGGCCTACG ACGTGTGCTT CTCGCTGGGC GACGGTCTGC GGCCCGGCTC GATTGCCGAT
GCCAATGACG AAGCGCAGTT TGCCGAGCTG CACACGCTGG GCGAACTCAC GCAGATCGCC
TGGAAACATG ATGTGCAGGT GATGATCGAA GGCCCTGGCC ATGTGCCGCT GCAGCTGGTG
AAGGAAAACG TCGACAAGCA GCTCGAGGCT TGTTTTGAGG CGCCGTTCTA CACGCTGGGC
CCGCTGATCA CCGACATCTC GCCGGGCTAC GACCATATTT CTTCCGCGAT GGGCGCGGCC
AACATCGGCT GGTATGGCAC AGCCATGCTC TGCTACGTGA CGCCGAAGGA GCACCTCGGG
CTGCCAAACC GCGACGACGT GAAGCAGGGC CTGATCGCCT ACAAGATTGC GGCCCACGCG
GGCGACCTGG CCAAGGGCTA CCCGGGCGCG CAGATGTGGG ACAACGCGGT GTCCAAGGCG
CGCTTCGAGT TCCGCTGGGA AGACCAGTTC CGCCTGGCGA TCGACCCGGA CACGGCCATG
GCCTACCACG ACGAAACCCT GCCGAAGGAA AACGCCAAGG TGGCGCATTT CTGTTCGATG
TGCGGACCGA AGTTCTGTTC GATGAAGATC TCGCAGGAAG TGCGCGAGTT TGCAAGGCTC
AATCCGGCCA GCACCACGCT GGCCGCGCCG GGCGTGATTG CGATCAAGCA GATCGACAGC
GGATTCGAGG AAAAGGCGAA AGAGTTTCGT GAAGGTGGCA GTGAGATTTA TTCTTGA
 
Protein sequence
MAKTRLTVDT NDLASRITRA PFPGSAKIYI EGSRPDIRVP FREVTLTDTM VHEGAGEPRR 
EANPPLRLYD ASGVYTDPAS PIDITRGLPP LRGAWINERA DTEALPGISS AYGRERLNDP
ALAALRMAHA PVPRRAKAGA NVSQMHYARK GIITPEMEYI AVRENLVRAQ LAERLATERM
PKKGHSFNAS IPEQITAEFV RDEVARGRAV IPNNINHPES EPMIIGRNFL IKVNANIGNS
AVTSSIEEEV DKLVWSIRWG ADTVMDLSTG ENIHETREWI LRNSPVPIGT VPIYQALEKV
NGKAEDLTWE IFRDTLIEQA EQGVDYFTIH AGVRLAYVPL TANRLTGIVS RGGSIMAKWC
LSHHKESFLY ERFDEICEIM KAYDVCFSLG DGLRPGSIAD ANDEAQFAEL HTLGELTQIA
WKHDVQVMIE GPGHVPLQLV KENVDKQLEA CFEAPFYTLG PLITDISPGY DHISSAMGAA
NIGWYGTAML CYVTPKEHLG LPNRDDVKQG LIAYKIAAHA GDLAKGYPGA QMWDNAVSKA
RFEFRWEDQF RLAIDPDTAM AYHDETLPKE NAKVAHFCSM CGPKFCSMKI SQEVREFARL
NPASTTLAAP GVIAIKQIDS GFEEKAKEFR EGGSEIYS