Gene Caul_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1902 
Symbol 
ID5899357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2040245 
End bp2041435 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID641562392 
Producthypothetical protein 
Protein accessionYP_001683529 
Protein GI167645866 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.986081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGC GCATCCTGGA AAAAGATGTC GCCGTCGTCG GCGTGGGCCA GTCAGAGGTC 
GGCCGTCCGT CCTCGCGCTC GGCCATGCAA CTGACCGTGG ACGCGGCCCT GGAAGCCATC
GCCGACGCCG GCCTCAACCC CAAGGACATC GACGGCGTCT GCTCCTGGCC CGGCGACAAC
ACCAACGGCT CCAGCTTCTC ACCAGTCGGG CCATTGGCGG TGCTGAGCAT GTTCGGCCTC
AACTGCAATT GGTTCAGCGG GGGCTACGAG GCGGCGGGGC CGCTGGCTGG CCTGATGAAC
GGCGCGATGG CCATCGCCTC GGGCATGGCG AAAAACGTGC TGGTCTTCCG CACGATCACC
GAGTCCAGCA ACCGCCTCAC CGGCAGCAAG GACCAGAGCC TCGCCGCCAA GTCCGGCCCG
CGCGACGGTA ATTTCATGTG GCAGTGGTGC ACGCCGTTCA ACGTGCTGTC GGTGGTCAAC
ATCACCGCCA TGTACGCTCG CAGCCATATG GAAAAGTACG GCACGACGCC GGAACAGTTG
GCCCAGATCG CCCTGAACGC CCGCCGCAAC GCCTTGCTCA ATCCCAAGGC CGTCATGCGC
AAGCCCATGA CGATGGACGA CTATTTCGCG TCGAAGATGA TCTCGACGCC GTTGCGGATG
TTCGACTGCG ACGTCCATTG CGACGCATCG ACCGCCATCG TCCTGTCACG CAAGGACATC
GCCATGGATC TGCGCAATCC GCCGATCCGC ATCGAGGCGA TCGGGGCGGC GATGAACCAG
CCCTATCTGT GGGACCAGGT CGACCTGACC GCCAACGCCA CGCCGGACGC GGCCAACGCC
ATGTGGGCGC GCACCGATTT CAAGCCCGCC GATGTGGACA CGGCCCAGCT CTACGACGGA
TTCAGCATCC TGACGATGAT GTGGCTGGAA GGCCTTGGCC TGTGCCCGAA GGGCGGAAGC
GGGGCCTTTG TCGAGGGCGG TCATCGGATC GCGCTGGACG GCGAACTGCC CTTGAACACC
AACGGCGGCC AGCTATCGGG CGGACGCACA CACGGACTGG GCTACGTCCA CGAGGCTTGC
ACCCAGCTTT GGGGGCGTGG CGGCGAGCGC CAGATCCGCG ACCCGCATGT GGCGGTCTGC
GCCGCCGGTG GCGGGCCGCT CGCGGGCAGC CTGTTGCTGG TCAAGGACTG A
 
Protein sequence
MGARILEKDV AVVGVGQSEV GRPSSRSAMQ LTVDAALEAI ADAGLNPKDI DGVCSWPGDN 
TNGSSFSPVG PLAVLSMFGL NCNWFSGGYE AAGPLAGLMN GAMAIASGMA KNVLVFRTIT
ESSNRLTGSK DQSLAAKSGP RDGNFMWQWC TPFNVLSVVN ITAMYARSHM EKYGTTPEQL
AQIALNARRN ALLNPKAVMR KPMTMDDYFA SKMISTPLRM FDCDVHCDAS TAIVLSRKDI
AMDLRNPPIR IEAIGAAMNQ PYLWDQVDLT ANATPDAANA MWARTDFKPA DVDTAQLYDG
FSILTMMWLE GLGLCPKGGS GAFVEGGHRI ALDGELPLNT NGGQLSGGRT HGLGYVHEAC
TQLWGRGGER QIRDPHVAVC AAGGGPLAGS LLLVKD