Gene Caul_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4012 
Symbol 
ID5901474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4343369 
End bp4344976 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content72% 
IMG OID641564533 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001685635 
Protein GI167647972 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0470647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATAG AGCGTGGGTC GATGATCGAC GGTGAGATGC AGGCGTTCTC CCTGACGCTC 
GACAAGTTCC TGGACCACGC GGCCAAGTGG CGTCCCAACG CCCAGGTCGT GACCGCGCGG
GACGACGGCC GGATCGACCG CGTCGGCTAT GCCGACCTGA AGGCCCGCAG CCTGCGCGCC
TCGGCGGCGC TGGCCGGGAT GGGCGTTGGG AAAGGCCACC GCGTCGCCAC CCTGGCCTGG
AACACGCAGG ATCACGTCGA GGTCTGGTAC GCGATCATGG GCATGGGCGC GGTCTGCCAC
ACGCTCAATC CGCGCCTGAC CGCCGAGCAC CTGGCGGCGA TGATCGTCCA GTCGCAGGCG
CGCATCCTGA TCGCCTCGGC CGACCTCGCG GTCCTGGCCC GCCAGATCCT GGACGGCGCG
CCCGGCGTCG AACGGCTGCT GATCATCGAC GCCGGCGACG CTGCGGCGCC GGACGGGGAA
CTGCTGGAGC CCCTGGTCGC CGCCGCGCGC GGCGAGGTCG CCTGGGGGGC GTTCGACGAG
ACCGCGCCCA GCGGCCTCTG CTTCACCTCG GGCACCACGG GCGCGCCCAA GGGCGTCACC
TACACCCACC GCTCCAGCTT CCTGCACACG CTGCGGGTGC TGCAGGCCGA CGTGATGGCG
ATCTCGGGAA CCGACAGCAT CCTGGCCGTG GTGCCGATGT TCCACGCCAA CGCCTGGGGC
CTGCCGTTCG CCGCGCCGGC GGTGGGCGCC AAGCTCGTCC TGCCCGGTCG CCACGCCGAC
GGCGCCAGCC TGGCGCGCCT GATCGCCGCC GAGGGCGTGA CGGTGGGCGT CGGGGTGCCC
ACGGTCTGGC TGGGCCTGGT CGAGCACCTG GAGGCGACCG GGGGCGAGCT TCCCTCGCTG
AAACGCATCA TCGTCGGCGG CGCGCCCATG GCCCCGGCCC TGATGGAGCG GATCGAGCGG
CGGCTGGGCG TCACGGTCCA GACCAGCTGG GGCATGACCG AACTGTCGCC CTCCGGGACG
GTGGCCGCGC TGAGCGATCC GTCCCGCGCG AGCCTTTCCG GGCGGCCGGC CGTGGGGGTG
GACCTGCTGC TCACCGACGA AGCCGGCCAG CCGCTGCCCG ACCAGCGCGA CGGCGAGGGG
CATCTGCGGG TGCGCGGCGC GGCGGTGATC GAGCGCTATT TTGGTCACGA CGCCCCGGCG
ACCGACGCCG ACGGCTGGTT CCCGACCGGC GACCTGGCGC GCATCGACGC CGACGGCAAT
CTGACGATCA CCGGCCGGGC CAAGGACCTG ATCAAGTCCG GCGGGGAATG GATCAATCCC
GCCGAGATCG AGGCGGTGAT CGGGGCGCTG CCGGAAGTTT CCCTGGCCGC CGTGATCGGC
CGTCCCGACC CCAAGTGGGG CGAGCGGCCG ATCCTGCTGG TCGAGATGCG GGGGCCTGAC
GAGGCCGGCG GTGAAATAGG CGACGAGGCG CTGCTGGCCT CGCTGCGCGG CCGGGTGGCG
CCCTGGTGGG TCCCCGACGC GGTCTATCGC CTGGCCCGCA TGCCTTTGGC GTCCACGGGC
AAGATCGACA AGATTCGGCT GAGATCGGAA TACGGCGGCG AGGGGTGA
 
Protein sequence
MSIERGSMID GEMQAFSLTL DKFLDHAAKW RPNAQVVTAR DDGRIDRVGY ADLKARSLRA 
SAALAGMGVG KGHRVATLAW NTQDHVEVWY AIMGMGAVCH TLNPRLTAEH LAAMIVQSQA
RILIASADLA VLARQILDGA PGVERLLIID AGDAAAPDGE LLEPLVAAAR GEVAWGAFDE
TAPSGLCFTS GTTGAPKGVT YTHRSSFLHT LRVLQADVMA ISGTDSILAV VPMFHANAWG
LPFAAPAVGA KLVLPGRHAD GASLARLIAA EGVTVGVGVP TVWLGLVEHL EATGGELPSL
KRIIVGGAPM APALMERIER RLGVTVQTSW GMTELSPSGT VAALSDPSRA SLSGRPAVGV
DLLLTDEAGQ PLPDQRDGEG HLRVRGAAVI ERYFGHDAPA TDADGWFPTG DLARIDADGN
LTITGRAKDL IKSGGEWINP AEIEAVIGAL PEVSLAAVIG RPDPKWGERP ILLVEMRGPD
EAGGEIGDEA LLASLRGRVA PWWVPDAVYR LARMPLASTG KIDKIRLRSE YGGEG