Gene Caul_5261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5261 
Symbol 
ID5897257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp195364 
End bp197010 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content68% 
IMG OID641555364 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001676695 
Protein GI167621910 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000293821 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCGC GCGATCGGGA CGACACGGGA ATCCTGCTGG GCGGTATGCA AGCCTGGGGT 
CTGACGATTG ACAGGGTCCT GGCTCATGCC GAGGCGGTTC ACCCCCAACG GTCGGTGGTC
ACCCGCACGG CTGAAGGCGA ACTGCGGTCC ACCGACTATG CCGGCGTGGC CGCCCAGGCG
CGCGCCCTGG CCCGGTCGCT GGCTCGGGTG GGCGTGCGGC GCGGCGATCG GGTGGCGATG
ATCGCCTGGA CCGGCGATCG CCACATGGCG CTGTGGTACG CGGTCTCGGC CTACGGCGCG
GTAAGCCATC CGATCAATCC CCGCTTCTCG CCCGACCAGA TCGCCTGGAT CGTCGGCCAT
GCCGGGGACC GCTTGATGTT CCTGGACAGC ACCTTCGTGC CCCTGGTCGA GGCGCTGCAG
GACCGCCTGC CGGGGATTGA AAGGTTCGTG CTGTTGGCCG ACGAGGTCGA CATGCCCGCC
ACCGGCCTGC GTGGGGCCAT CAGTTACGAG GCCTTCCTGG CCCTGGGAGA GGGCGAGGCG
GACCTGGCGC CCGGGGGGTT CGACGAGAAC GCCGCCTGCG CGCTGTTCTA CACCTCCGGC
ACCACCGGCG ATCCCAAAGG CGTGCTCTAT TCTCACCGCT CCAACGTCCT GCACGCCATG
ATGCTGTCGC CCGCGTTGAA TCTGACGAGC CATGACGTGA TGATGCCCGT TGTGCCGATG
TTCCACGCCA ACGGCTGGGG TCTGCCCTAT GCCTGCCCCA TGGTCGGGGC GGCGATGGTC
ATGCCGGGCG CAGCCCTGGA TCCGGCCTCG CTTCACGCCC TGATGGAGGC GCAAGGCGTG
ACCATCACCG CCGGCGTGCC GACGCTCTGG CAGTCGCTGC TGCAGCATAT GAAGGACACC
GGCGCCCGGT TTTCGACCCT GCGCACCATC TTGGTAGCCG GCTCGGCCGC GCCGCGGGCT
TTGTTGACCG AGTATCGCGA GCGGTTCGGT GTCGAGGTGC GCCATCTCTG GGGGATGACC
GAGACAAGCC CCTGCGGCAC GGCCAACCCG CTCCCGCCGC AGGGGCAGGA CCATGATGTC
GAGGCGGCGG TGCGCGGCGA ATTGCGCCAG GGGCGCAATC CCTTCGGCCT GGAGATGCGG
GTCGCCAACG AGGCGGGCGC GTGGTTACCC CACGACGGCC GCTCGGCCGG CCGCCTGATG
GTGCGCGGCG CCGCCGTTGT CGAGCGCTAC TTTCGGGGTG AGCGTCCGGC CATCGACGCC
GAGGGCTGGT TCGACACCGG CGACGTGGCC ACTATCCATC CCGATCACGT CATGCAGATC
ACCGACCGGG CCAAAGACTT GATCAAGTCC GGCGGCGAAT GGATTAGTTC AATCGCCATC
GAGGACGCCG CGGCGCTGCA TCCAGCCACC GCCCTTTGCG CGGTCATCGC CATGCCGCAC
GCGAAGTGGG GCGAGCGCCC CCTATTGGCT GTCAAGCTCA AGTCGGGCGC AAGCGGGCAG
GCGGCCGACT ATCTGACCTT CCTGGAGGGC AAGATCGCCA AATGGTGGAT GCCCGACGAG
GTGGTGTTTA TTGAGGACAT GCCCCTGGGC GCCACCGGAA AGGTCGACAA GAAGGCCCTG
CGGGCGCGGC TGGTTCCCCA GGGCTGA
 
Protein sequence
MAARDRDDTG ILLGGMQAWG LTIDRVLAHA EAVHPQRSVV TRTAEGELRS TDYAGVAAQA 
RALARSLARV GVRRGDRVAM IAWTGDRHMA LWYAVSAYGA VSHPINPRFS PDQIAWIVGH
AGDRLMFLDS TFVPLVEALQ DRLPGIERFV LLADEVDMPA TGLRGAISYE AFLALGEGEA
DLAPGGFDEN AACALFYTSG TTGDPKGVLY SHRSNVLHAM MLSPALNLTS HDVMMPVVPM
FHANGWGLPY ACPMVGAAMV MPGAALDPAS LHALMEAQGV TITAGVPTLW QSLLQHMKDT
GARFSTLRTI LVAGSAAPRA LLTEYRERFG VEVRHLWGMT ETSPCGTANP LPPQGQDHDV
EAAVRGELRQ GRNPFGLEMR VANEAGAWLP HDGRSAGRLM VRGAAVVERY FRGERPAIDA
EGWFDTGDVA TIHPDHVMQI TDRAKDLIKS GGEWISSIAI EDAAALHPAT ALCAVIAMPH
AKWGERPLLA VKLKSGASGQ AADYLTFLEG KIAKWWMPDE VVFIEDMPLG ATGKVDKKAL
RARLVPQG