Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4331 |
Symbol | |
ID | 3749530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 1283098 |
End bp | 1285029 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637762620 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_368571 |
Protein GI | 78065802 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0917809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA ATCCGAAGTT TCTGTCCGCC GACGCTCATG TCGATGCCGC GGCCGTCGCC CCGCTGCCGA ATTCCCGCAA GGTTTATGTA ACCGGCTCCC AGCCCGACAT CCGCGTGCCG ATGCGTGAAA TCACGCAGGC CGACACGCCG ACCGGCTTCG GCGGCGAAAA GAATCCGCCG ATCTACGTGT ACGACACGTC GGGCCCCTAC ACCGATCCGG AAGCGAAGAT CGACATCCGC GCAGGCCTGG CCGCGCTGCG TCAGGGCTGG ATCGACGCAC GCGGCGACAC CGAAGTGCTC GGCGGCCTGT CGAGCGAGTA CGGCCTCGAG CGCGCGGCCG ACCCGGCCAC CGCCGACCTG CGGTTCCCGG GCCTGCACCG CAACCCGCGC CGCGCCCAGG CCGGCAAGAA CGTCTCGCAG ATGCACTACG CGCGCCAGGG CATCATCACG CCGGAAATGG AATACATCGC GATCCGCGAG AACCAGCGCC GCGCCGAGTA CCTCGAGAGC CTGAAGGCCA GCGGCCCGAA CGGCGCGAAG CTCGCCGCGA TGATGGGCCG CCAGCACCCG GGCCAGGCGT TCGGCGCAGC GGCTTTCGGC GCGAACGCGC CGGCCGAGAT CACGCCGGAA TTCGTGCGCT CGGAAGTGGC GTGCGGCCGC GCGATCATCC CCGCGAACAT CAACCACCCG GAATCCGAGC CGATGATCAT CGGCCGCAAC TTCCTCGTGA AGATCAACGC GAACATCGGC AACTCGGCCG TCACGTCGTC GATCGGCGAG GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ACACGGTGAT GGACCTGTCG ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC GGCACGGTGC CGATCTACCA GGCGCTGGAA AAGGTCAACG GCAAGGCCGA GGATCTCACC TGGGAAATCT TCCGCGACAC GCTGATCGAA CAGGCCGAGC AAGGCGTCGA CTATTTCACG ATCCACGCGG GCGTGCGCCT GCAGTACGTG CCGCTCACCG CGAACCGGAT GACCGGCATC GTGTCGCGCG GCGGCTCGAT CATGGCGAAG TGGTGCCTCG CGCATCACAA GGAAAGCTTC CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG CTCGGCGACG GCCTGCGCCC CGGATCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC GAGCTGAAGA CGCTCGGCGA ACTCACGCAG ATCGCGTGGA AGCACGACGT GCAGGTGATG ATCGAAGGCC CCGGCCACGT GCCGATGCAG CTGATCAAGG AGAACATGGA TCTCCAGCTC GACTGGTGCA AGGAAGCGCC GTTCTACACG CTCGGGCCGC TGACCACCGA CATCGCACCG GGCTACGACC ACATCACGTC CGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG ATGCTGTGCT ACGTGACGCC GAAGGAACAC CTCGGCCTGC CAAACAAGGA CGACGTGAAG GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCCGCCG ACCTCGCGAA GGGTCACCCG GGCGCGCAGG TGCGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGGAAGAC CAGTTCAACA TCGGTCTCGA TCCGGACAAG GCGCGCGAAT TCCACGACGA AACGCTGCCG AAGGATTCGG CGAAGGTCGC GCACTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG AAGATCACGC AGGACGTGCG CGAGTTCGCG GCGCAGCAGG GCGTGTCGGA AACCGAAGCG CTGAAGAAGG GGATGGAAGT GAAAGCGGTC GAGTTCGTCA AGACCGGCGC CGAAATCTAT CACCGTCAAT AA
|
Protein sequence | MNANPKFLSA DAHVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TGFGGEKNPP IYVYDTSGPY TDPEAKIDIR AGLAALRQGW IDARGDTEVL GGLSSEYGLE RAADPATADL RFPGLHRNPR RAQAGKNVSQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK LAAMMGRQHP GQAFGAAAFG ANAPAEITPE FVRSEVACGR AIIPANINHP ESEPMIIGRN FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP GAQVRDNALS KARFEFRWED QFNIGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM KITQDVREFA AQQGVSETEA LKKGMEVKAV EFVKTGAEIY HRQ
|
| |