Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3175 |
Symbol | |
ID | 5900630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3439154 |
End bp | 3440989 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641563679 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001684800 |
Protein GI | 167647137 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTC AACTGCCCAT CAAAGACGCC ATCGGCGCGA TCCCGACCGG GGAGCGCCCC GGTTCGCGCA AGGTCTATCA GGCCGGCTCG CTGTTCCCCG ACATCCGGGT GCCGTTCCGT GAGGTCGCTG TCCATCCCAG CGCCAACGAA CCGCCGGTCA CCATCTATGA CCCGTCCGGC CCCTATACCG ACCCGCACGC CAAGATCGAC ATCGAGCAGG GCCTGGAGCG GTCGCGCGAG CCGTGGATCA TCGCCCGCGG CGACTGCGAG TTGGTGGCCA CCCCCCGGGA GGTGAAGCCC GAGGACAACG GCTTCGCCCA GGGCAAGCAC CTGGCGCCCC AGTTCACCGC CAAGCGCCCG ATCTTCAAGG GCGCGCAAGG CAAGCTGGTC ACCCAGCTGG AATACGCCCG CGCCGGCATC GTCACCGCCG AGATGGAATA TGTGGCCATC CGCGAGAACC TGCGCCGCGA GCAGGACCGC CCGTGCGTGC GCGACGGCGA GGACTTCGGC GCCTCGATCC CCGACTTCGT GACCCCCGAA TTCGTCCGCC AGGAAGTGGC GCGCGGCCGG GCCATCATTC CGGCCAACAT CAACCACGGC GAGCTGGAGC CGATGGCGAT CGGCCGCAAT TTCCTGGTCA AGATCAACGC CAACATCGGC AACAGCGCCG TGCTTTCCAC CGTGGCCGAC GAGGTCGACA AGCTGGTGTG GGCCACGCGC TGGGGCGCCG ACACGGTCAT GGACCTGTCG ACCGGCCGCA ACATCCACAA CATCCGCGAC TGGATCATCC GCAACAGCCC GGTGCCGATC GGCACGGTGC CGATCTACCA GGCGCTGGAG AAGGTCAACG GCGTGGCCGA GGACCTGAAC TGGGAAGTCT TCCGCGACAC CCTGATCGAG CAGGCCGAGC AGGGGGTGGA CTATTTCACC ATCCACGCCG GCGTCCGTCT TCCGTTCATC CCGCTGACCG CCAAGCGGGT GACGGGCATC GTCTCGCGCG GCGGCTCGAT CATGGCCAAG TGGTGCCTGG CACACCACAA GGAGAACTTC CTCTACGAGC GCTTCGAGGA CATCTGCGAG ATCATGCGCA GCTACGACGT GTCGTTCTCG CTGGGCGACG GCCTGCGTCC GGGCTCGACG GCCGACGCCA ATGACGAGGC CCAATTCGCC GAGCTGCGCA CCCTGGGCGA GCTGACCAAG GTGGCCTGGA AGCACGGCGT GCAGGTGATG ATCGAAGGGC CGGGCCACGT CGCCATGCAC AAGATCAAGG CCAACATGGA CGAGCAGCTC AAGCACTGCC ACGAGGCCCC CTTCTACACG CTCGGTCCGT TGACGACGGA CATCGCCCCT GGCTACGACC ACATCACCAG CGCCATCGGC GCGGCGATGA TCGGCTGGTT CGGCACGGCC ATGCTCTGCT ACGTGACGCC CAAGGAGCAC CTGGGCCTGC CCGACCGCGA CGACGTCAAG ACCGGCGTCA TCACCTACAA GCTGGCCGCC CACGCCGCCG ACCTGGCCAA GGGTCACCCC GGCGCGGCCA TGTGGGACGA CGCCATCAGC CGGGCGCGGT TCGAGTTCCG CTGGGAGGAC CAGTTCAACC TGGGCCTCGA CCCCGAGACC GCCCGGGCCT TCCACGACGA GACCCTGCCC AAGGAGGCGC ACAAGACCGC GCACTTCTGC TCGATGTGCG GTCCCAAGTT CTGCTCGATG AAGATCAGCC AGGAAGTCCG CGAATTCGCG GCCGGCATGG CCCCCAACTC CATCGAACAG GGCATGGCGG AGATGAGCGA CAAGTTCCGC GAACAGGGTT CGGAAATCTA TCTGAAGACG GAATAG
|
Protein sequence | MNVQLPIKDA IGAIPTGERP GSRKVYQAGS LFPDIRVPFR EVAVHPSANE PPVTIYDPSG PYTDPHAKID IEQGLERSRE PWIIARGDCE LVATPREVKP EDNGFAQGKH LAPQFTAKRP IFKGAQGKLV TQLEYARAGI VTAEMEYVAI RENLRREQDR PCVRDGEDFG ASIPDFVTPE FVRQEVARGR AIIPANINHG ELEPMAIGRN FLVKINANIG NSAVLSTVAD EVDKLVWATR WGADTVMDLS TGRNIHNIRD WIIRNSPVPI GTVPIYQALE KVNGVAEDLN WEVFRDTLIE QAEQGVDYFT IHAGVRLPFI PLTAKRVTGI VSRGGSIMAK WCLAHHKENF LYERFEDICE IMRSYDVSFS LGDGLRPGST ADANDEAQFA ELRTLGELTK VAWKHGVQVM IEGPGHVAMH KIKANMDEQL KHCHEAPFYT LGPLTTDIAP GYDHITSAIG AAMIGWFGTA MLCYVTPKEH LGLPDRDDVK TGVITYKLAA HAADLAKGHP GAAMWDDAIS RARFEFRWED QFNLGLDPET ARAFHDETLP KEAHKTAHFC SMCGPKFCSM KISQEVREFA AGMAPNSIEQ GMAEMSDKFR EQGSEIYLKT E
|
| |