Gene Caul_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0579 
Symbol 
ID5898034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp630437 
End bp632002 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content64% 
IMG OID641561061 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001682210 
Protein GI167644547 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACCCT GGTCGCTAAC GCTGGATAAG ATTTCGGACC ATGCCGCCCG CTGGCATGGC 
CGGGTTGAAA TCATATCCCG CCGGGCTGAC GGGTTGAGCC GACGCACCAA TTGGGCGGAG
CTGCGCGATG TCGCCCAGCA GGTTACCGGC GCGCTGGCAG CGCAGGGCGT TGTCTTGGGC
GATCGGGTCG GCACGCTGGC GATGAACAGC GATCGTCATC TCGCCGCCTG GTTTGGAATC
ATGAACATGG GCGCCGTCTG CCATACTCTC AATCCTCGTC TGTCCGACGA GCAATTGGCT
TATGTGATCA ACCATGCCGG CGATCGCCTG ATCCTGGCCG ACCGGCACTT TGGCGAGGCG
GTGGAACGTC TGCGCCCGCA CTGTCCTGCA GTTGAGCGGG TGGTGTGGCT CGACGACAAT
GGGCCGGACG GCTGGGAGGC GTGGCTGGAA GGGCGGTCGC AGGATTGCTC TTGGGGCGGA
TTCCCCGAGG AGTCGCCGGC CGGCCTATGC TACACCTCGG GCACGACCGG ACGACCCAAG
GGAGTGACCT ATACCCATCG GTCAAACTAC CTCCATACGC TGATGATCAT GCAGCCGGAC
GTGTTCAGTT TCAGTGCGCG GACCAACCTG TTGCTTGCCG TGCCCATGTT CCACGCCAAT
GCATGGGGGA TGTGCTTTGC CGCTGCGGCC GCCGGGTCAA AGCTTGTGTT GCCAGGACCC
AAGCTGGATG GCGCCAGCCT GTACGAGCTG TTGGAGGAGG AAGGCGTCAC TCTGACCGCG
GGCGTGCCCA CGGTGTGGCA GACGCTGCTG CAGTACCTGG GCGACAACAA GCTCCGGCTG
TCGGCGCTTG AGCGGGTGAT GATTGGCGGC GCGCATTGCC CCGAGGCGAT GATCCACGCC
TTCGCTGACC ATGGCGTTGA GGTACAGTGT AACTGGGGGA TGACTGAAAC CTCCCCCCTC
GGCGCCGCTG GCGCCCCGAC CGCGGAAATC GCGAAGCTGG ACCGGGATGC GCAGGTCAAG
AACAAGCTGA CCCAGGGTCG CGTGCCCTTG GGCGTCGACA TCGCAATCTT CGACGCCGAC
AGAAACGAGT TGCCGCGCGA CGGCCAGCAC ATCGGCTTTT TGGGTGTGCG TGGCCACTCG
GTGCTGGAGC GGTATTTTGC AAGCGATGAG ACGGCGCTGG ACTCACAGGG TTTCTTCGAT
ACCGGCGACA TTGGCGCCAT AGACGCCGCA GGCTATCTGC GGCTCACCGA TCGCGCCAAG
GATGCGATCA AGTCCGGCGG GGAGTGGATC AGTTCGTCCG AGATCGAGAA TGTAGCGCTC
AACCATCCAA GCGTTGCCGC CGCCGCCGCC CTGGCGGTGC CGCATCCCAA ATGGGGCGAG
CGCCCGCTCT TGATCGTGCA GCCCAAGAGC GGCGACAATC TGGACGCGGC GGGAATCCGC
GCCCTGCTTG AGCGAGACCT CGCCAAATGG GCGGTGCCTG ATGAAATCCG GTTTTGCGAC
ACGATCCCGG TCAACGGCAC GGGCAAGATC GACAAGGTCG CACTGCGCCG CCAGATCTTT
GGCTAG
 
Protein sequence
MIPWSLTLDK ISDHAARWHG RVEIISRRAD GLSRRTNWAE LRDVAQQVTG ALAAQGVVLG 
DRVGTLAMNS DRHLAAWFGI MNMGAVCHTL NPRLSDEQLA YVINHAGDRL ILADRHFGEA
VERLRPHCPA VERVVWLDDN GPDGWEAWLE GRSQDCSWGG FPEESPAGLC YTSGTTGRPK
GVTYTHRSNY LHTLMIMQPD VFSFSARTNL LLAVPMFHAN AWGMCFAAAA AGSKLVLPGP
KLDGASLYEL LEEEGVTLTA GVPTVWQTLL QYLGDNKLRL SALERVMIGG AHCPEAMIHA
FADHGVEVQC NWGMTETSPL GAAGAPTAEI AKLDRDAQVK NKLTQGRVPL GVDIAIFDAD
RNELPRDGQH IGFLGVRGHS VLERYFASDE TALDSQGFFD TGDIGAIDAA GYLRLTDRAK
DAIKSGGEWI SSSEIENVAL NHPSVAAAAA LAVPHPKWGE RPLLIVQPKS GDNLDAAGIR
ALLERDLAKW AVPDEIRFCD TIPVNGTGKI DKVALRRQIF G