Gene Bcep18194_A4331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4331 
Symbol 
ID3749530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1283098 
End bp1285029 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content65% 
IMG OID637762620 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_368571 
Protein GI78065802 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0917809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA ATCCGAAGTT TCTGTCCGCC GACGCTCATG TCGATGCCGC GGCCGTCGCC 
CCGCTGCCGA ATTCCCGCAA GGTTTATGTA ACCGGCTCCC AGCCCGACAT CCGCGTGCCG
ATGCGTGAAA TCACGCAGGC CGACACGCCG ACCGGCTTCG GCGGCGAAAA GAATCCGCCG
ATCTACGTGT ACGACACGTC GGGCCCCTAC ACCGATCCGG AAGCGAAGAT CGACATCCGC
GCAGGCCTGG CCGCGCTGCG TCAGGGCTGG ATCGACGCAC GCGGCGACAC CGAAGTGCTC
GGCGGCCTGT CGAGCGAGTA CGGCCTCGAG CGCGCGGCCG ACCCGGCCAC CGCCGACCTG
CGGTTCCCGG GCCTGCACCG CAACCCGCGC CGCGCCCAGG CCGGCAAGAA CGTCTCGCAG
ATGCACTACG CGCGCCAGGG CATCATCACG CCGGAAATGG AATACATCGC GATCCGCGAG
AACCAGCGCC GCGCCGAGTA CCTCGAGAGC CTGAAGGCCA GCGGCCCGAA CGGCGCGAAG
CTCGCCGCGA TGATGGGCCG CCAGCACCCG GGCCAGGCGT TCGGCGCAGC GGCTTTCGGC
GCGAACGCGC CGGCCGAGAT CACGCCGGAA TTCGTGCGCT CGGAAGTGGC GTGCGGCCGC
GCGATCATCC CCGCGAACAT CAACCACCCG GAATCCGAGC CGATGATCAT CGGCCGCAAC
TTCCTCGTGA AGATCAACGC GAACATCGGC AACTCGGCCG TCACGTCGTC GATCGGCGAG
GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ACACGGTGAT GGACCTGTCG
ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC
GGCACGGTGC CGATCTACCA GGCGCTGGAA AAGGTCAACG GCAAGGCCGA GGATCTCACC
TGGGAAATCT TCCGCGACAC GCTGATCGAA CAGGCCGAGC AAGGCGTCGA CTATTTCACG
ATCCACGCGG GCGTGCGCCT GCAGTACGTG CCGCTCACCG CGAACCGGAT GACCGGCATC
GTGTCGCGCG GCGGCTCGAT CATGGCGAAG TGGTGCCTCG CGCATCACAA GGAAAGCTTC
CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG
CTCGGCGACG GCCTGCGCCC CGGATCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC
GAGCTGAAGA CGCTCGGCGA ACTCACGCAG ATCGCGTGGA AGCACGACGT GCAGGTGATG
ATCGAAGGCC CCGGCCACGT GCCGATGCAG CTGATCAAGG AGAACATGGA TCTCCAGCTC
GACTGGTGCA AGGAAGCGCC GTTCTACACG CTCGGGCCGC TGACCACCGA CATCGCACCG
GGCTACGACC ACATCACGTC CGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG
ATGCTGTGCT ACGTGACGCC GAAGGAACAC CTCGGCCTGC CAAACAAGGA CGACGTGAAG
GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCCGCCG ACCTCGCGAA GGGTCACCCG
GGCGCGCAGG TGCGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGGAAGAC
CAGTTCAACA TCGGTCTCGA TCCGGACAAG GCGCGCGAAT TCCACGACGA AACGCTGCCG
AAGGATTCGG CGAAGGTCGC GCACTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG
AAGATCACGC AGGACGTGCG CGAGTTCGCG GCGCAGCAGG GCGTGTCGGA AACCGAAGCG
CTGAAGAAGG GGATGGAAGT GAAAGCGGTC GAGTTCGTCA AGACCGGCGC CGAAATCTAT
CACCGTCAAT AA
 
Protein sequence
MNANPKFLSA DAHVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TGFGGEKNPP 
IYVYDTSGPY TDPEAKIDIR AGLAALRQGW IDARGDTEVL GGLSSEYGLE RAADPATADL
RFPGLHRNPR RAQAGKNVSQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK
LAAMMGRQHP GQAFGAAAFG ANAPAEITPE FVRSEVACGR AIIPANINHP ESEPMIIGRN
FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI
GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI
VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG
ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP
GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP
GAQVRDNALS KARFEFRWED QFNIGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM
KITQDVREFA AQQGVSETEA LKKGMEVKAV EFVKTGAEIY HRQ