Gene Caul_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0212 
Symbol 
ID5897486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp226196 
End bp227815 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID641560696 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001681847 
Protein GI167644184 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.240297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTGG GCCTGATGCA GACCACGCCC TTGCTGGTCA GCGGCATCCT GCGCTACGCG 
GCCGCCGCCC ACGGCGGACG CGAGATCGTG TCGCGGCTGA TCGACGAGCC CGTCTGGCGC
TACGACTATG CCGGCCTGTC GCGCCGTTCG GCCCAGGCGG CCAACGCCCT GGCGCGGCTG
GGCGTGACCT CCGGCGACAG GGTCACGTCC CTGGCCTGGA ACACGCACCG GCACCTGGAG
CTGTTCTACG CGGTCCCTGG CCTGGGCGCG GTGCTGCACA CCGCCAATCC CCGGCTGTCG
GACGAGCAGA TCGTCTTCAC GATCAACCAC GCGGCCAGCG GCGTCCTGCT GTTCGATCGC
AATTTCGCCG AGCTGGTCGC CCGCCTCGCG CCGCGCCTGA CCACGGTGAA GACCTTCGTG
ATGCTGTCGG ACGCCGAGCG AACCCATGAC GCCGGCGTCA GGGCGAGGTC GTACGAGACC
CTGATCGCCG GTGAGGCCGA GACCTTCGAC TGGCCCAGCT TCGACGAGAA CGCCGGGGCC
TTCCTCTGCT ACACCTCGGG CACGACGGGC GATCCCAAGG GGGTGCTCTA TTCGCACCGC
GCCGTGGTGC TGCACGCCAT GGCCGGCGGT CTGGCCAGCG CCTTTGGCCT GACGGCCTTC
GACGTGGTGA TGCCGTGCTC CAGCCTCTAC CACGCCACGG CCTGGGGCCT GCCGTTCACC
GCCCCGATCT GCGGCTCCAA GCTGGTCCTG CCCGCCGACA AGATGGATGG GGCCTCACTA
CACCAGCTGA TCCAGGACGA GGGCGTCACC TTCACCGGCG GTGTGCCGAC CATCTGGACG
ATGTATCTCT CCTGGCTGGA GCAGACCGGC CAGCGGCCGG ACACTCTACG CAGGGTCGTG
ATCGGCGGCA GCGCCGTGCC CCGCGCCATG GCCGCGACCT TCAAGACGAA GTATGGCGTC
GACGTACTGC AGATCTGGGG CATGACCGAG ACCTGCCCGA TCGGCGTGGT CGCCACCCCG
ACCCCATCCC TGGCCGCCCT GGGCGACGAG GCGATGAGCG ACGCCATCTG GACCCGCCAG
GGACGGCTGC AATTCGGCAT CGAGCTGAAG GTCGAGAACG AGGACGGCTC GGAGGCCCCG
CGCGACGGCG AGACGTCCGG AGCCCTGAAG GTGCGCGGAC CCTGGGTGGT GCGGCGCTAC
TACCGCCAGG AGGCCGACGT CGCCGACGCC GACGGCTGGT TCGACACCGG CGACATCGCC
ACCCTCGACG AACACGGCTT CATGCGGATC ACCGATCGCC AGAAGGATGT GATCAAGTCG
GGCGGCGAGT GGATCAGCTC GATCGATCTG GAGAACATCG CCGCCGGCTG CCCGGGCGTG
AAGATCGCCG CCGTGGTCGG CGTGCCCCAC CCGAAGTGGG AGGAGCGGCC GCTGCTGGTC
ATCGAGGTCC ACGAGGGCTC GGTGGTCTGC AAGGCGGAGG TGCTCGCCTA CCTGGGATCG
CGGATCGTCA AGTGGTGGAC GCCCGACGAC GTAGTGTTCG CGGCGGTGCC GCTGACGGCG
ACGGGCAAGA TCGACAAGAA GGTGCTGCGC GAGGTATGGC GGGGGCATTT GATGGGGTAG
 
Protein sequence
MILGLMQTTP LLVSGILRYA AAAHGGREIV SRLIDEPVWR YDYAGLSRRS AQAANALARL 
GVTSGDRVTS LAWNTHRHLE LFYAVPGLGA VLHTANPRLS DEQIVFTINH AASGVLLFDR
NFAELVARLA PRLTTVKTFV MLSDAERTHD AGVRARSYET LIAGEAETFD WPSFDENAGA
FLCYTSGTTG DPKGVLYSHR AVVLHAMAGG LASAFGLTAF DVVMPCSSLY HATAWGLPFT
APICGSKLVL PADKMDGASL HQLIQDEGVT FTGGVPTIWT MYLSWLEQTG QRPDTLRRVV
IGGSAVPRAM AATFKTKYGV DVLQIWGMTE TCPIGVVATP TPSLAALGDE AMSDAIWTRQ
GRLQFGIELK VENEDGSEAP RDGETSGALK VRGPWVVRRY YRQEADVADA DGWFDTGDIA
TLDEHGFMRI TDRQKDVIKS GGEWISSIDL ENIAAGCPGV KIAAVVGVPH PKWEERPLLV
IEVHEGSVVC KAEVLAYLGS RIVKWWTPDD VVFAAVPLTA TGKIDKKVLR EVWRGHLMG