Gene Caul_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4095 
Symbol 
ID5901557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4447567 
End bp4448787 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID641564615 
Producthypothetical protein 
Protein accessionYP_001685717 
Protein GI167648054 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.591324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC TATCGTCCCC CTCATCGGGC GCCGCTGTCG TGGCTGGAAC CGCTTGGGCG 
GAGGCCTTCG CGGAAGCGCT CGCCAGTCTC CGCGCCCAGG GCCAGCGCTC GGCCCTGGCG
CTTCTCGGCA TTCTGATCGG CTCGGCCTCG ATCGTGGCCA TGCTGACCAT CGGGCACATG
GCCCAGCGCG AGACGCTGAA GCTGTTCTCC CACCTCGGCG TCGATATCGT GCAGATCCGC
GCCAATCCCA TCGGTCCGAA GCCCGCCGGC ATGGACCAGG AGGCGATCGA GCGGCTGCCT
CGCTCCGACC CCGACGTCCT GGCGGCCCTT CCCCTGTCGA TCGATCGCGC CAAGGTCGCC
GTGGGAGGAG TCAGTCAGGA CGCGGGCCTG ATCGCGGTGA CCGCCGATCT TCCCCGTCTG
GTTCGGCTTT CGATGAAAGG CGGTCGACTG TTCGGGCAAG CCGACCAGGG TAGTCTCGTG
GCCGTGCTGG GCTCCGAGAC GGCGGCGGCC CTGTCGACCG CTGGCGCTCC GGTCCGGGCG
GGCAGCCAGA TCCGCGTGCG AGACTATGTG TTCACGGTGA TCGGCGTGCT CGACCCTGTG
GCGTTCACCG CGATGGATCC GGTCGACTAC AACACCGCCG TGATCGTTCC GCTGAGCGAC
GCGCCCAGGA TCATGGCCTC GCCGGAGCCG AGTGTCGCGC TGCTGCGGCT GAGGCCGGGC
GCCGACATCG CGGCGACGGG CCAACGCCTG ACGGCTCGGC TGACCCGACC CGACTCTGCC
CTGCAGGTGC TCAGCGCCCA GGAGCAGATC AAGAATCTCA ACGCCCAGAA GGCCATCCAC
GGCCGGTTGC TGACGGCGAT CGGCGCGATC TCGCTGCTGG TCGGCGGCAT CGGGGTGATG
AACGTGATGC TGATGGGCGT GATGGAGCGG CGGCGGGAGA TCGGCCTGCG GGCCGCGCTG
GGCGCGACGC CGAGGGACCT GCGGATCATG TTCCTGGTCG AAGCCGCCGT CCTGACCTTC
GTGGGCGGGC TGGTGGGCCT GGTGTTCGGA CTTCTGGCGG CCTTCGCGGC GGCCCGAGCT
TCGGGGTGGA CCTTCAGCCT GGCGCTCTAT GTGCTGCCGC TCGGGCCGGG CATCGCGGCC
TTGGTCGGGA TCACGTTCGG GCTCTATCCC GCGATCAAGG CCTCCCGGCT GGATCCTATC
GAAGCCCTCA GGACGGAATG A
 
Protein sequence
MTALSSPSSG AAVVAGTAWA EAFAEALASL RAQGQRSALA LLGILIGSAS IVAMLTIGHM 
AQRETLKLFS HLGVDIVQIR ANPIGPKPAG MDQEAIERLP RSDPDVLAAL PLSIDRAKVA
VGGVSQDAGL IAVTADLPRL VRLSMKGGRL FGQADQGSLV AVLGSETAAA LSTAGAPVRA
GSQIRVRDYV FTVIGVLDPV AFTAMDPVDY NTAVIVPLSD APRIMASPEP SVALLRLRPG
ADIAATGQRL TARLTRPDSA LQVLSAQEQI KNLNAQKAIH GRLLTAIGAI SLLVGGIGVM
NVMLMGVMER RREIGLRAAL GATPRDLRIM FLVEAAVLTF VGGLVGLVFG LLAAFAAARA
SGWTFSLALY VLPLGPGIAA LVGITFGLYP AIKASRLDPI EALRTE