Gene Caul_3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3991 
Symbol 
ID5901453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4321004 
End bp4322029 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content75% 
IMG OID641564512 
ProductApbE family lipoprotein 
Protein accessionYP_001685614 
Protein GI167647951 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.604582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCG TCCTCGTCCC CCAGCTCGCC GAGCCGCCCG CCCGCCCGAT CGGCGGTGCG 
GTGCTGGCGC TCGCCGGCCA GACGATGGGC ACGACCTGGT CGGTCAAGCT GGTGGCGCCG
CCGACGGCCA ACGCCGAGGC CCTGACGGCC ATGGCCCAGC GTGAGCTCGA CGCCGTGGTC
CGCGAGATGA GCCCGTGGGA GCCGGAGTCC GATCTCTCCC GCTACAACCG CGCCGCCGCC
GGGAGCTGGA CCGCCCTGCC CCCGGCCTTC GCCCAGGTGC TGCGCTGCGC CCTGGAGATC
GCCGAGGCGA CCGACGGAGC CTTCGATCCG ACGCTGGGCG GCCTGGTCGA CCTCTGGGGT
TTCGGCCCCC GCCCCTTCTC CGGCGCGCCG CCGCGAGCCC GAGACATCGC CATCGCTCGC
GAGACCGCCG GCTGGCGCCG CCTGGTCCTC GACGGCGACA GCCTGTTGCA ACCCGGCGGC
CTGCGCCTGG ACCTCAATGG CGTCGCCAAG GGCTTCGCGG TCGATCAGGT CGCCGCCGCC
CTGGGCCGGG CCGGCGCGCG CTCGTACCTG GTCGAGGTGG GCGGCGAGCT GCGCGGGACC
GGCGCCAAGC CCGACGGCCA ACCCTGGTGG GTCGAGCTGG AACGCCCGCC GGCCGCGCCC
GCGCGCGGAT GCGCGCCTCT TCCGCTAGTT GATGACGGCC CGCGCACCCT GGTCGCCCTG
CACGATCTGT CGGCCGCCAC CTCGGGCGAC TACCGCCGGT TCTTCGAGCA CGACGGCCGT
CGCTACGCCC ACACCCTGGA CCCCGCCACG GCCGCGCCGG TCACCCATTC GACGGTCAGC
GTCACCGTGC TCGACCAGAG CTGCATGCGC GCCGACGCCT ACGCCACCGC CCTGACCGTG
ATGGCGCCCG ACGCCGCCCT GGCCTTCGCC GCCGCCCATG GCCTGGCCGC CCTGATCCTC
GCCAACGGCG CGCACGGCCT GGAGGAGCGC CTGTCGCCGG CCCTTGAGGC GATGCTCGAC
GCATGA
 
Protein sequence
MTRVLVPQLA EPPARPIGGA VLALAGQTMG TTWSVKLVAP PTANAEALTA MAQRELDAVV 
REMSPWEPES DLSRYNRAAA GSWTALPPAF AQVLRCALEI AEATDGAFDP TLGGLVDLWG
FGPRPFSGAP PRARDIAIAR ETAGWRRLVL DGDSLLQPGG LRLDLNGVAK GFAVDQVAAA
LGRAGARSYL VEVGGELRGT GAKPDGQPWW VELERPPAAP ARGCAPLPLV DDGPRTLVAL
HDLSAATSGD YRRFFEHDGR RYAHTLDPAT AAPVTHSTVS VTVLDQSCMR ADAYATALTV
MAPDAALAFA AAHGLAALIL ANGAHGLEER LSPALEAMLD A