Gene Caul_4353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4353 
Symbol 
ID5901814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4729064 
End bp4730605 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content74% 
IMG OID641564871 
Producthypothetical protein 
Protein accessionYP_001685971 
Protein GI167648308 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.92083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCG CGGACGTCCT GATCACCACC CTGGCCGACA ACGGCGTCAC CGCCTGCTTC 
GCCAATCCCG GCACCAGCGA GATGCAGTTC GTCGCGGCCC TGGACCGCGA GCCGCGCATG
CGCTCGGTGC TGTGCCTGTT CGAGGGGGTG GCGACCGGGG CGGCCGACGG CTGGGGGCGG
ATGACCGGGA CGCCGGCCTG CACCCTGCTG CACCTGGGTC CCGGCTACGC CAACGGCGCG
GCCAACCTGC ACAACGCCCG GCGGGCCTTC ACGCCCACGG TCAATGTCAT CGGCGACCAC
GCGACCGATC ACCGGAAGTA CGACGCCCCG CTCAATTCCG ACATCGCCGC CCTGGCCGCG
CCCAACGCCC TGTGGGTCAA GTCGGCCGAC AGCGCCGACG CCGTCGGACC CCTCGCCGCC
GAGGCCGTGG CCGCCAGCTT CGGCGCGCCC GGCGGAAACG CCTGCCTGAT CCTGCCCGCC
GACAGCGCCT GGAACGAGAC CAACACCGCC GGGCCGGTGA TCGAGCGCCC CCTTCCGCGC
GCTCCGAACC CGGCCTCGGT CGCTGCGGTC GCCAAGGCGC TGAAGGCCGC CACCAAGCCG
GTGCTGCTGT TGGGGAGCGG CGCGTGCGGC GAGGCGGCCA TCGCCGCCGC CGGGCGGATC
GCGGCGACCG GCGTGCGGGT GCTGACCGAC ACCTTCACCG CCCGCCAGGC GCGGGGCGAA
GGACGTTTCC GACCCGACAA GCTGCCCTAT TTCGGCGAGC AGGCCCTGGC CGACCTCGAC
GGCGTCGACC TGATGGTGCT GGTCGCCACG ACCGAGCCGG TGGCGTTCTT CGCCTATCCC
GACCGGCCCA GCGTGCTGGT CCCGAACGGC TGCGCGACGC TGGAGCTCAG CGACCGGGGA
ACCGACGCGG CCGCCGCGCT GGAGGCCCTG GCCGAGATCC TCGGCGCGCC GGCGGTCGGG
GCGATCGAGC CCTATGCCCC GGCCGCCGCC CCGTCCGGCA AGCTGGACGC CTGGGCCATC
GGCGCGGCCA TCGCTCGCCA CATGCCGGCG GGCGCGATCA TCTCGGACGA CGCGGTGACC
GCCGGCCTGC CGATCTTCAC CCAGACCCGC AACGCCCGGG CTCACGACTG GCTGTCCTTG
ACCGGCGGCG CGATCGGCCA GGGCGTCCCG CTGGCCATCG GCGCGGCTGT CGCCTGTCCC
GACCGCAAGG TCCTGGCCCT GACCGGCGAC GGGGCGGGGA TGTACACGGT GCAGGGCCTG
TGGACGATCG CCCGCGAGAA GCTGGACATC ACCGTGGTGG TGTTCGCCAA CCACGCCTAC
CGCATCCTCG GCATCGAGAT GGGCCGCACC GGCGGCGGCG AACCGGGTCC GGCGGCCTCG
CGCCTGCTGG ACCTCTCCGA TCCCCGCATC GACTGGGTCG CGGTGGCCGG CGGGCTGGGC
GTGCCGGCGC AACGGGTCGA GACGGCGGAG GCGTTCGACG AGGTCATGGC TCGGGCGATG
CGCGAGCCGG GGCCGCGGTT CATCGAGGCG GCGATACGGT AG
 
Protein sequence
MNGADVLITT LADNGVTACF ANPGTSEMQF VAALDREPRM RSVLCLFEGV ATGAADGWGR 
MTGTPACTLL HLGPGYANGA ANLHNARRAF TPTVNVIGDH ATDHRKYDAP LNSDIAALAA
PNALWVKSAD SADAVGPLAA EAVAASFGAP GGNACLILPA DSAWNETNTA GPVIERPLPR
APNPASVAAV AKALKAATKP VLLLGSGACG EAAIAAAGRI AATGVRVLTD TFTARQARGE
GRFRPDKLPY FGEQALADLD GVDLMVLVAT TEPVAFFAYP DRPSVLVPNG CATLELSDRG
TDAAAALEAL AEILGAPAVG AIEPYAPAAA PSGKLDAWAI GAAIARHMPA GAIISDDAVT
AGLPIFTQTR NARAHDWLSL TGGAIGQGVP LAIGAAVACP DRKVLALTGD GAGMYTVQGL
WTIAREKLDI TVVVFANHAY RILGIEMGRT GGGEPGPAAS RLLDLSDPRI DWVAVAGGLG
VPAQRVETAE AFDEVMARAM REPGPRFIEA AIR