Gene Caul_5445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5445 
Symbol 
ID5897120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp158550 
End bp160184 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content59% 
IMG OID641550732 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001672218 
Protein GI167621710 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.513748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGTTTGA TGATGGACCG GCCGTTGATG ATATCGCAGC TTATCGAGCA TGCGGCTGAA 
TATCATGGCG ACAACGAGAT CGTGTCTAGG TCGGTCGAGG GCCCGATCCA TCGTCATACC
TATGCCGACG CGGCCCGGCG CTCACGCCAG CTCGCTAAAG CGCTTGAGGC GCTGGGCGTC
GCGCCGGGTG ACCCGATAGG AACCCTGGCG TGGAACGGCT ATCGGCATTT CGAGATTTAC
TTTGCGGTAT CCGGGATCGG CGCCATCTGC CATACGATTA ACCCGCGCCT CTTCCCAGAG
CAGATCGCTT ATATCATCAA CCATGCAAAT GATCGGTTCA TCTTCGCCGA CCTGAACGTA
CTTGCCATCC TTGAAGGGCT CGAGAAGAGT TTGGCAGGCG TGCGGGGAGT CATTGTGATG
ACTGATCGCG CTCACATGCC GGCCAGCAGC GCCCTACCCA ATATGCTGTG TTACGAGGAT
CTTGTCGCCG CTCAAGACGA GGCGTTTGAC TGGCCCGAGT TCGATGAGAA TAGCGCTGCG
TCGCTCTGCT ACACCTCCGG CACCACGGGC AACCCGAAGG GCGTTCTTTA CAGTCATCGT
TCGACCATCC TCCATGCTTA TGCCATCAAC AGCGCGAATG CTCTGGGCCT TACGGTCGAT
GACGCGATAC TCCCGGTCGT GCCGATGTTC CACGCCAATG CCTGGGGCAT CCCTTACGCG
GCTCCGATGG TCGGCGCCAA GCTTGTGTTA CCGGGCTTCA AGATGGATGG CGCCAGTCTG
TTCGAATTAT TCGATAGCGA GGATGTCACC GTAGCGGCCG GCGTGCCTAC GGTGTGGCAG
GAACTGTTGC GCTTTTGCGA AGCCGGCGGC CGATCCTTAG GCAAACTGCA GCGTACGCTG
ATCGGCGGCT CGGCGCCGCC GCGCGCCATG ATCGAACGTT TTGACAGGGA GCATGGGGTG
CGGGTCATGC AGGGCTGGGG CATGACAGAG ATGAGCCCAC TGGGCACGAT CACCTCGATG
CGGCGCGGCG AGCGCGATCT ACCGGCTGAG ACGCAATACG ACCTCATCGC CAAGCAGGGT
CGACCTATCT TCGGTGTCAG TTTGAAGATC GTCGATGACG CCGGTCGCGA GTTGCCCAAG
GACGGGGTTG CGTTCGGAAA CCTGCTGGTC CGAGGTCCTT GGGTCGCAAG GTCTTACCTT
CATGGCGAGG ACCCTTCAGC GTTCACCGAC GATGGCTGGT TTCATACTGG CGACGTCTGC
ACGATCGATC CGCACGGGTA TATGACGATC ACCGACCGCT CCAAGGACGT TATTAAATCA
GGCGGCGAGT GGATCAGTTC GATCGACCTG GAGAATGTCG CGATGGATCA TTCCGGGGTC
CAAGAGGCTG CAGTGATTGG CATCGTCCAT CCCCAGTGGG ACGAGCGACC GCTGCTCGTT
GTTGTGAGAC GATCTGAATC GAAGGTAACG CGCGAGGAAT TGCTGAACTC GTTCGAGGGG
AAGGTCGCAA AGTGGTGGAT TCCGGATGAT GTCGTCTTTG TCGATAGTTT GCCTCACACC
GCGACGGGAA AGTTGCTTAA GGCAAAGCTG CGGGAAGATT TTCGTGGATA CAAAGCGCCC
TCGACTGCTG GCTGA
 
Protein sequence
MGLMMDRPLM ISQLIEHAAE YHGDNEIVSR SVEGPIHRHT YADAARRSRQ LAKALEALGV 
APGDPIGTLA WNGYRHFEIY FAVSGIGAIC HTINPRLFPE QIAYIINHAN DRFIFADLNV
LAILEGLEKS LAGVRGVIVM TDRAHMPASS ALPNMLCYED LVAAQDEAFD WPEFDENSAA
SLCYTSGTTG NPKGVLYSHR STILHAYAIN SANALGLTVD DAILPVVPMF HANAWGIPYA
APMVGAKLVL PGFKMDGASL FELFDSEDVT VAAGVPTVWQ ELLRFCEAGG RSLGKLQRTL
IGGSAPPRAM IERFDREHGV RVMQGWGMTE MSPLGTITSM RRGERDLPAE TQYDLIAKQG
RPIFGVSLKI VDDAGRELPK DGVAFGNLLV RGPWVARSYL HGEDPSAFTD DGWFHTGDVC
TIDPHGYMTI TDRSKDVIKS GGEWISSIDL ENVAMDHSGV QEAAVIGIVH PQWDERPLLV
VVRRSESKVT REELLNSFEG KVAKWWIPDD VVFVDSLPHT ATGKLLKAKL REDFRGYKAP
STAG