Gene Caul_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3175 
Symbol 
ID5900630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3439154 
End bp3440989 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content66% 
IMG OID641563679 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001684800 
Protein GI167647137 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTC AACTGCCCAT CAAAGACGCC ATCGGCGCGA TCCCGACCGG GGAGCGCCCC 
GGTTCGCGCA AGGTCTATCA GGCCGGCTCG CTGTTCCCCG ACATCCGGGT GCCGTTCCGT
GAGGTCGCTG TCCATCCCAG CGCCAACGAA CCGCCGGTCA CCATCTATGA CCCGTCCGGC
CCCTATACCG ACCCGCACGC CAAGATCGAC ATCGAGCAGG GCCTGGAGCG GTCGCGCGAG
CCGTGGATCA TCGCCCGCGG CGACTGCGAG TTGGTGGCCA CCCCCCGGGA GGTGAAGCCC
GAGGACAACG GCTTCGCCCA GGGCAAGCAC CTGGCGCCCC AGTTCACCGC CAAGCGCCCG
ATCTTCAAGG GCGCGCAAGG CAAGCTGGTC ACCCAGCTGG AATACGCCCG CGCCGGCATC
GTCACCGCCG AGATGGAATA TGTGGCCATC CGCGAGAACC TGCGCCGCGA GCAGGACCGC
CCGTGCGTGC GCGACGGCGA GGACTTCGGC GCCTCGATCC CCGACTTCGT GACCCCCGAA
TTCGTCCGCC AGGAAGTGGC GCGCGGCCGG GCCATCATTC CGGCCAACAT CAACCACGGC
GAGCTGGAGC CGATGGCGAT CGGCCGCAAT TTCCTGGTCA AGATCAACGC CAACATCGGC
AACAGCGCCG TGCTTTCCAC CGTGGCCGAC GAGGTCGACA AGCTGGTGTG GGCCACGCGC
TGGGGCGCCG ACACGGTCAT GGACCTGTCG ACCGGCCGCA ACATCCACAA CATCCGCGAC
TGGATCATCC GCAACAGCCC GGTGCCGATC GGCACGGTGC CGATCTACCA GGCGCTGGAG
AAGGTCAACG GCGTGGCCGA GGACCTGAAC TGGGAAGTCT TCCGCGACAC CCTGATCGAG
CAGGCCGAGC AGGGGGTGGA CTATTTCACC ATCCACGCCG GCGTCCGTCT TCCGTTCATC
CCGCTGACCG CCAAGCGGGT GACGGGCATC GTCTCGCGCG GCGGCTCGAT CATGGCCAAG
TGGTGCCTGG CACACCACAA GGAGAACTTC CTCTACGAGC GCTTCGAGGA CATCTGCGAG
ATCATGCGCA GCTACGACGT GTCGTTCTCG CTGGGCGACG GCCTGCGTCC GGGCTCGACG
GCCGACGCCA ATGACGAGGC CCAATTCGCC GAGCTGCGCA CCCTGGGCGA GCTGACCAAG
GTGGCCTGGA AGCACGGCGT GCAGGTGATG ATCGAAGGGC CGGGCCACGT CGCCATGCAC
AAGATCAAGG CCAACATGGA CGAGCAGCTC AAGCACTGCC ACGAGGCCCC CTTCTACACG
CTCGGTCCGT TGACGACGGA CATCGCCCCT GGCTACGACC ACATCACCAG CGCCATCGGC
GCGGCGATGA TCGGCTGGTT CGGCACGGCC ATGCTCTGCT ACGTGACGCC CAAGGAGCAC
CTGGGCCTGC CCGACCGCGA CGACGTCAAG ACCGGCGTCA TCACCTACAA GCTGGCCGCC
CACGCCGCCG ACCTGGCCAA GGGTCACCCC GGCGCGGCCA TGTGGGACGA CGCCATCAGC
CGGGCGCGGT TCGAGTTCCG CTGGGAGGAC CAGTTCAACC TGGGCCTCGA CCCCGAGACC
GCCCGGGCCT TCCACGACGA GACCCTGCCC AAGGAGGCGC ACAAGACCGC GCACTTCTGC
TCGATGTGCG GTCCCAAGTT CTGCTCGATG AAGATCAGCC AGGAAGTCCG CGAATTCGCG
GCCGGCATGG CCCCCAACTC CATCGAACAG GGCATGGCGG AGATGAGCGA CAAGTTCCGC
GAACAGGGTT CGGAAATCTA TCTGAAGACG GAATAG
 
Protein sequence
MNVQLPIKDA IGAIPTGERP GSRKVYQAGS LFPDIRVPFR EVAVHPSANE PPVTIYDPSG 
PYTDPHAKID IEQGLERSRE PWIIARGDCE LVATPREVKP EDNGFAQGKH LAPQFTAKRP
IFKGAQGKLV TQLEYARAGI VTAEMEYVAI RENLRREQDR PCVRDGEDFG ASIPDFVTPE
FVRQEVARGR AIIPANINHG ELEPMAIGRN FLVKINANIG NSAVLSTVAD EVDKLVWATR
WGADTVMDLS TGRNIHNIRD WIIRNSPVPI GTVPIYQALE KVNGVAEDLN WEVFRDTLIE
QAEQGVDYFT IHAGVRLPFI PLTAKRVTGI VSRGGSIMAK WCLAHHKENF LYERFEDICE
IMRSYDVSFS LGDGLRPGST ADANDEAQFA ELRTLGELTK VAWKHGVQVM IEGPGHVAMH
KIKANMDEQL KHCHEAPFYT LGPLTTDIAP GYDHITSAIG AAMIGWFGTA MLCYVTPKEH
LGLPDRDDVK TGVITYKLAA HAADLAKGHP GAAMWDDAIS RARFEFRWED QFNLGLDPET
ARAFHDETLP KEAHKTAHFC SMCGPKFCSM KISQEVREFA AGMAPNSIEQ GMAEMSDKFR
EQGSEIYLKT E