Gene Caul_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4604 
Symbol 
ID5902066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4979849 
End bp4981327 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content67% 
IMG OID641565123 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001686222 
Protein GI167648559 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.37521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG ATGGTGTCGC GCTAAGTTCC ATTCTAACCC ACCATGCGCG TCGATCCCCC 
TCTCGCACGG CGCTGATCGT CGATGGCGTT CGCGTCGCCT ATGACGAACT GGACGCGCGC
ACAAACCGTC GCGCCAGGAT GCTGGCGGCG CATGGCGTAG GCCATGGCGA CTTCGTCACG
GTCGCGCTTC CCAATGGCCT GGAATTCTAC GAGACCACCT TCGCGCTCTG GAAACTCGGG
GCGATCCCCA ACATCGTCGC CGCCAAGCTC CCGCGCCTCG AAATGGAGGC GATCCTCGAC
ATCGTTCGCC CCAGGCTGTT TGTCGGCGTC CCGCCCGGGG GCGACGTTCC GGCCCTGGCC
GAAGGTCAGG CGGAGTTGCA CCGATATTCG ACCGATCCGC TGCCGGAAGT TATCTCGCCG
CACTGGAAAG CGATGACGAG CGGCGGCTCT ACCGGCCGGC CGAAGGTGAT CGTGGACGCC
ATGCCGGCGC GGTGGAATCC GCAGGAGGGC TTCCTGGGCC AGCGTCCTGG CGACGTGATC
CTCAATCCCG GGCCGCTCTA TCACAACGCA CCGTTCCACT GCGTCCACAT GGGTCTGTTC
GTCGGCGCCA CGATCGTCGA GATGGGCAAG TTCGATGCGC TCGCCGCGCT CGAACTGATC
GACGCCCATC AGGTCAATTG GGTGACCATG GTGCCGACGA TGATGCACCG CGTGTGGCGC
CTGGACCCAG AGGTTCGGTC CCGCTTCACG CTGCCCAGTC TGCGCATGAT GCTCCACATG
GCCGCCCCCT GCCCCGCCTG GCTCAAGGAG GCCTGGATCG GCTGGCTGGG CGGCGAGCGG
GTGTGGGAAT ATTACGGCAC GACCGAAGGG ACGGGATCGA CGATGATTTC CGGCACGGAC
TGGTTGGCTC ATCCAGGTTC GGTGGGGCGC GTCCGTGAGG GCTATGCGCT GAAGATTCTC
GACGAGACGG GGCGGGAGCG ACCGATCGGC GAGGTCGGCG AGGTCTATTT CCGCCCAGAG
GGCGGCGCGG GATCGACCTA CCACTATCTG GGAAGCACGC CCCGGCGGGT CGGCGAATGG
GAGACGCCCG GGGACCTGGG GCATGTGGAC GAGGACGGCT ATCTCTATCT TTCCGACCGT
CGCAACGACC TGATCATCTC CGGCGGCGCG AACATCTACC CGGCCGAGGT CGAGGCCGCG
ATCGACGCGC ATCCGGCCGT TCGGACCAGC GCGGTGATCG GGCTTCCGGA CGAGGAGTGG
GGCGCGCGCG TCCATGCGAT CGTCCAGCCG ATCGAGGACT CAGGCCTGGA GGAGGCGGAG
CTTCTCGCGT TCGTCGCCGA CCGGCTGGCG CGCTTCAAGC TGCCCAAGAG CGTCGAGTTC
ACGCGTGACC CCTTGCGGGA CGAGGCTGGA AAGGTCCGTC GGACCGCGCT GCGCGACGCT
CGATTGGGCG GAGGGGCGGG GCAGGTTGTC CCAGCCTAG
 
Protein sequence
MSHDGVALSS ILTHHARRSP SRTALIVDGV RVAYDELDAR TNRRARMLAA HGVGHGDFVT 
VALPNGLEFY ETTFALWKLG AIPNIVAAKL PRLEMEAILD IVRPRLFVGV PPGGDVPALA
EGQAELHRYS TDPLPEVISP HWKAMTSGGS TGRPKVIVDA MPARWNPQEG FLGQRPGDVI
LNPGPLYHNA PFHCVHMGLF VGATIVEMGK FDALAALELI DAHQVNWVTM VPTMMHRVWR
LDPEVRSRFT LPSLRMMLHM AAPCPAWLKE AWIGWLGGER VWEYYGTTEG TGSTMISGTD
WLAHPGSVGR VREGYALKIL DETGRERPIG EVGEVYFRPE GGAGSTYHYL GSTPRRVGEW
ETPGDLGHVD EDGYLYLSDR RNDLIISGGA NIYPAEVEAA IDAHPAVRTS AVIGLPDEEW
GARVHAIVQP IEDSGLEEAE LLAFVADRLA RFKLPKSVEF TRDPLRDEAG KVRRTALRDA
RLGGGAGQVV PA