Gene Caul_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3003 
Symbol 
ID5900458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3269067 
End bp3270605 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content73% 
IMG OID641563500 
Producthypothetical protein 
Protein accessionYP_001684628 
Protein GI167646965 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCG CCGACGCCTT GATCACCACC CTGGCCGACA ACGGCGTCAC CGCCTGCTTC 
GCCAATCCCG GCACCAGCGA AATGCAATTC GTCGCCGCGC TGGACCGCGA GCCGCGCATG
CGCTCGGTGC TGTGCCTGTT CGAGGGCGTG GCCACCGGGG CCGCCGACGG CTACGGCCGC
ATGGCCGGCA AGCCGGCCTG CACTCTGCTG CATCTCGGTC CGGGCTACGC CAACGGCGCG
GCCAATCTGC ACAATGCCCG CCGCGCCTTC ACGCCGGTGG TCAACATCAT CGGCGACCAC
GCCACCTATC ACCGCGGCCA TGACGCCCCG CTGAACTCCG ACATCGCCGC CCTGGCCGCG
CCCAATTCGA TCTGGGTGAA GTCGGCCGAG ACCGCCGACG CGGTGGGTCC CCTGGCGGCC
GAAGCGGTGG CCGCCAGCCA AGGTCCGCCC GGCGGCGTCG CCTGCCTGAT CCTGCCCGCC
GATAGCGCCT GGGACGAGAC GACGGTGACG GGGCCGGTGG CGGCTGCGAC GCCCCGGCGC
GCCCCGGACC TGGCCGAGGT CGAGGCGCTG GCCGCCCGAT TGAAGGCGGC GCGCAAGCCG
GTGCTGCTGA TCGGCGGCGC GGCCTGCGGC GAGACCGGCC TGGCGGCGGC CGGCCGCATC
GCCGCCGCCG GCGTTCGCGT GCTCACCGAC ACCTTCGTGG CGCGGATGGC GCGCGGCGCG
GGCCGGTTCG CGCCCGATCG CATGCACTAT TTCGCCGAGA TGGCCCTGGG CGACCTGGAG
GGCGTCGACC TGATGGTCCT CGTCGGGACA GCCCCTCCGG TGGCCTTCTT CGCCTATCCC
GACCGCCCCA GCGTGCTAGT TCCAGACGGC TGCGAGGTCG TCGCCCTGGG CGACCGAGCG
GCGGACGGGT CGGCCATCCT GCACGCCCTG GCGCAGGCCC TGGGCGCGCC GGCCCGGGGG
CCGGTCGAGT CGCGCGTCCG CCCCGCGGCG CCGCAAGGGC CGCTGACCCC CGCCGCCATC
GGCCAGGCGA TCACCCGCCA TCTGCCCGAC CAAGCCATCG TCAGCGACGA CGCGGTCACC
TGCGGTCTGC CGATCTTCCT GGCCACGCAG ACGGCGGAGC CTCATGACTG GCTGATGCTG
ACCGGCGGCG CCATCGGTCA GGGGATTCCC CTGGCGATCG GCGCGGCCGT CGCCTGTCCC
GATCGCAAGG TCGTGGCCCT GAGCGGCGAC GGCGCGGCGA TGTTCACCGT CCAGGGCTTG
TGGACCATCG CCCGCGAGCA ACTCGACGTC ACGGTGGTGG TGTTCGCCAA CCACGCCTAC
CGCATTCTCG ACATCGAAAT GTATCGCATG AGCGGCGGCC CGGCGGGGCC AACGGCGCAA
CGATTGCTGG ACCTGGGCGC CCCGCGCATC GACTGGCTGA ACCTGGCCCG GTCGCTGGGA
ATGGCGTCGG TTCGCGCCGA CAGCGCCGAA GCGTTCGACG TCGCGTTCGA GACCGCCATG
AGCCGCCCCG GGCCGACCTT CATAGAGGCC GCGCTGTAG
 
Protein sequence
MNGADALITT LADNGVTACF ANPGTSEMQF VAALDREPRM RSVLCLFEGV ATGAADGYGR 
MAGKPACTLL HLGPGYANGA ANLHNARRAF TPVVNIIGDH ATYHRGHDAP LNSDIAALAA
PNSIWVKSAE TADAVGPLAA EAVAASQGPP GGVACLILPA DSAWDETTVT GPVAAATPRR
APDLAEVEAL AARLKAARKP VLLIGGAACG ETGLAAAGRI AAAGVRVLTD TFVARMARGA
GRFAPDRMHY FAEMALGDLE GVDLMVLVGT APPVAFFAYP DRPSVLVPDG CEVVALGDRA
ADGSAILHAL AQALGAPARG PVESRVRPAA PQGPLTPAAI GQAITRHLPD QAIVSDDAVT
CGLPIFLATQ TAEPHDWLML TGGAIGQGIP LAIGAAVACP DRKVVALSGD GAAMFTVQGL
WTIAREQLDV TVVVFANHAY RILDIEMYRM SGGPAGPTAQ RLLDLGAPRI DWLNLARSLG
MASVRADSAE AFDVAFETAM SRPGPTFIEA AL