Gene Caul_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0423 
Symbol 
ID5897697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp462994 
End bp464523 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content67% 
IMG OID641560909 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001682058 
Protein GI167644395 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA TCATCTCCGG CGAGCGACGT CTTCCCTACG ACGTCTTGGA AGAACACGTC 
GCCGCGGTCG CCAGCCGCTT AAGCGAACGG GGCGTTCGAG CCGGCGAGGC GGTGGTGCTG
CTGCTGCGCA ACGATCTGGC CTTCTTCGAG GCCGCCTTGG GCGCGATCCG CATCGGGGCC
TATGCGACCC CCGTCAACTG GCACGCCTCC AGCGAGGAGC TGGCCTTCAT CCTGCAAGAC
AGCGCCGCCA AGGTGCTGAT CGCTCACGTC GACCTGTTTA ACGCCGTCGC CGACGACCTG
CCCAGCCACG TCGAGGTCGT GCTTGTCGAG ACGCCCCCGG AGCTGGCGGC CGCCTATCGG
GTGGCCGAGG CCGACCGGTT CCCCCGCGCC GGCCATCAAA CTTGGCGCGA GTGGCTGTCG
GCCGGCGCGG CCCAGGCGAC GCCGGTGAGC GCTCAGACCA CGGCGATGAT CTACACCTCG
GGCACGACGG GCCGCCCCAA GGGCGTGCGT CGCACGCCGC CGACGGCCGA GCAGACCTTG
GTGCAAATCC AAAACGCCAT CCGCAACTTC GGCCTGGGGG AGCTGGGAAC CGTCGTGCTG
ATGAACGGGC CAATGTACCA CACCGCCCCC AACGGCTACG GCATGATGGC CGCCCGTTTT
GGCCACACGA TCGTCCTGGA ACCCCGGTTT GACGCCGAGG AAATGCTGCA GCTGATCGAG
CGCCATCGGG TCACGCACAT GCATGTCGTG CCGACCATGT TCGTGCGCCT ATTGCGCCTG
CCCGCGGCGG TGCGCGAGCG CTACGACCTG TCGTCCTTGC GTTTCGTCGT GCACGGCGCC
GCGCCCTGCC CGGTCGAGGT CAAGCAACAG ATGATCGCTT GGTGGGGGCC GGTCATCAAC
GAGTACTACG GCTCCACGGA GACCGGCATC GTCGCCTGGC ATGACGCCGA GCAAGCCTTG
AGCCGGCCCG GAACGGTCGG CCAGGTTTGC CCCGGCGCGG TGGTGAAGGC CTTCGACGAG
GACGGCCGGC CCCTGGGCCC GGGCGAGGTC GGCGATCTCT ATATGCGCTC GGCCGGCATG
ACCGACTTCA CCTATCACGG CCGCGACGAC GAGCGCGCCG CCGTCGGACG CGAGGACCTG
ATCTGCGTGG GCGACATCGG CTGGGTCGAT GCGGACGGCT ATGTCTTTCT GTGTGATCGC
CGCAAGGACA TGATTATTTC CGGCGGGGTC AACATCTATC CCGCCGAGAT CGAGGCGGTC
TTGATCGGCC TGGAAGGCGT GCGCGATTGC GCCGTTTTCG GCATCCCAGA TTCCGAATTT
GGCGAAGCGG TCTGCGCCCA TATCGAGGTG GAGCCCCTGG GCGCGCCGAG CTTGGACGTC
GTGCGGTCTC ACCTGGCCGC GCGCCTGGCC AAGTTCAAGG TCCCCAAGGT GATCGAGTTC
GCCCACGCCT TGCCGCGCGA GGATTCCGGC AAGATCTTCA AGAAGCGCCT GCGCGAGCCC
TATTGGGAAG GGCTGGGACG CAAAATCTGA
 
Protein sequence
MSEIISGERR LPYDVLEEHV AAVASRLSER GVRAGEAVVL LLRNDLAFFE AALGAIRIGA 
YATPVNWHAS SEELAFILQD SAAKVLIAHV DLFNAVADDL PSHVEVVLVE TPPELAAAYR
VAEADRFPRA GHQTWREWLS AGAAQATPVS AQTTAMIYTS GTTGRPKGVR RTPPTAEQTL
VQIQNAIRNF GLGELGTVVL MNGPMYHTAP NGYGMMAARF GHTIVLEPRF DAEEMLQLIE
RHRVTHMHVV PTMFVRLLRL PAAVRERYDL SSLRFVVHGA APCPVEVKQQ MIAWWGPVIN
EYYGSTETGI VAWHDAEQAL SRPGTVGQVC PGAVVKAFDE DGRPLGPGEV GDLYMRSAGM
TDFTYHGRDD ERAAVGREDL ICVGDIGWVD ADGYVFLCDR RKDMIISGGV NIYPAEIEAV
LIGLEGVRDC AVFGIPDSEF GEAVCAHIEV EPLGAPSLDV VRSHLAARLA KFKVPKVIEF
AHALPREDSG KIFKKRLREP YWEGLGRKI