Gene Caul_4813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4813 
Symbol 
ID5902275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5208026 
End bp5209396 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID641565333 
ProductHlyD family type I secretion membrane fusion protein 
Protein accessionYP_001686431 
Protein GI167648768 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.474577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGA CCGAACAAAC CGTCCGGGCC GGCCTGCCGG CCCGTTGGGA CCAGGACGAG 
GTCACCAGCG ACAATCCCGG TCACGAGATC AAGGCCGGGG CGCTGATCGC GCTGGCGTTC
TTTGGTGTCT TCCTCGGCTG GTCGTTCATC GCGCACCTGG ACGCCGCCGC GGTCGCCCAA
GGGTCGATCT CGGTCGCCGG CCACCGGCAG ACCGTGCAGC ACAAGGACGG CGGCATCGTT
TCGGCGATCT ACGTCAAGGA AGGCCAGCAT GTGAAGGCGG GCCAAGTGCT GATCGATCTG
GCTCCCGCCG ACGTCGAGGC GCTGGAACGG TCGATGGCCG CCCAGGTCAT CGGCCTGCAA
GCCCAGCGCG CCCGTCTCTA TGCGGAGCGT CTCGGTCTTT CGTCGATGGA GGCGCCGGCC
GAGTTCGCCA GCCTGACCGG CTACGACAAG GAGGAAGCCG ATCGCGCCCT GAAGATGCAG
CAGATCGAGC TGATGGCCGG CCGCCGCGCG GTGGGTGGAC AGAAGGCCGT TCTGGCCCAG
CGCTCGGCGC AGATCTCGCG CCAGATCGAA GGCTTCACCC AGCAGGCGCG CTCGACCGAC
GACCAGTCGC GGTTGATCAA CGACGAACTG CAGGGCACTC GCGACCTGGC CGGCAAGGGC
TATGCCTCGG TCAATCGGGT CCGCGCCCTG GAACGCACGG CCGCCGGCCT GGCTGGCTCG
CGCGCCGAAC TCGACGCCAA CGCCGCCCGG GCTCGCGAGC AGATCGGCGA AACCCGCATG
CAGGCCGCCA GCCTCGACAG CGACCGGGCC GAGCAAGTGG CCAAGGAAAT GCGCGACGTC
GAATTCCAGC TGAACGACCT GCTGCCCAAG CTGCGCGCGC TCAAGGAACA GCTGGCGGGA
ACCTCGATAC GCGCGCCGGC CACCGGCCAG GTGGTGGGCC TGACGATCTT CACGGTCGGC
GGCGTCATCG CGCCCGGCCA GCACCTGCTG GACATTGTGC CGGACATGGC TCCGCTGGTC
ATTGAGGCCC AGGTCAACCC GTCCGACGCC GGCGACCTCT ATGTCGGCCA GGAGACCGAG
GTGAAGATCG CCTCGCTGCA CGACCGTCAA ATCCCGATCC TCAAGGGCGA ACTGACGCGG
GTTTCGGCCG ACAGCTTCAC CGACGAGAAG AGCGGCGCGC GATACTTCAC CGCCGAGGTG
ACCGTGCCCG TCAGCCAGCT CCAGACCCTG AGCACCAAGA CCGGGGCCGT CTACAAGCTG
AAGCCGGGCC TGCCGGTGCA GATCATGGTG CCGCTGCGCA AGCGCACGGC GTTCCAGTAT
CTGACCGAAC CGCTGACCTC GGCCCTCTGG CGGTCGTTCC GCGAGCACTA G
 
Protein sequence
MNLTEQTVRA GLPARWDQDE VTSDNPGHEI KAGALIALAF FGVFLGWSFI AHLDAAAVAQ 
GSISVAGHRQ TVQHKDGGIV SAIYVKEGQH VKAGQVLIDL APADVEALER SMAAQVIGLQ
AQRARLYAER LGLSSMEAPA EFASLTGYDK EEADRALKMQ QIELMAGRRA VGGQKAVLAQ
RSAQISRQIE GFTQQARSTD DQSRLINDEL QGTRDLAGKG YASVNRVRAL ERTAAGLAGS
RAELDANAAR AREQIGETRM QAASLDSDRA EQVAKEMRDV EFQLNDLLPK LRALKEQLAG
TSIRAPATGQ VVGLTIFTVG GVIAPGQHLL DIVPDMAPLV IEAQVNPSDA GDLYVGQETE
VKIASLHDRQ IPILKGELTR VSADSFTDEK SGARYFTAEV TVPVSQLQTL STKTGAVYKL
KPGLPVQIMV PLRKRTAFQY LTEPLTSALW RSFREH