Gene ECH_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0042 
SymbolvirB10 
ID3927638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp39295 
End bp40638 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content33% 
IMG OID637901166 
Producttype IV secretion system protein VirB10 
Protein accessionYP_506874 
Protein GI88658039 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG AAAATCAAAA CAACCAATAC ACAGAAATTG AAGAGAATGT AAACGTTGTT 
GGAGTAAATA AAGGAAAAAA ATTTATCACA ATAGGAGTGA TGTTAGTTGG ACTAGGGTTT
GCTTATTATT ATTTCTTCAC AGGCAAAAAA CCAGATGATA GTCTCAACAA ACCACAAACA
ACAGAAGAAG CAGACATAGA AAAATTATTA AAAGAATCTG TACCACCAAC CCAAGAAGTA
TCTCCACCTA TAAACATTCC ACCTCAATTA CCGGAGTTAC CACCTTTAGT ATCACCATCT
TTACCATCAA TTCCGACAGT TGAAAAACCT AAAGTACTAG AGATACCCAA AATACCAAAA
ATTAAACCAA AACAACCTGT TGAAATTAAA CCTAAACCGC AGCCCCAACC TGAACCTTTA
CCAAAAATAC CTTTACCCGT TCAAAATAAA ATAGACATAG CTGCTCCTAT TGCACCTATA
ACAACAGGAT ATGACAAAGA AAGAAGAGCT ACATCAATGT TAGCAGTTTC AGGAGGACAA
AATGTAGCAT CTGGCACTTC TGAAAACGGA GAGAGAGAAT CAAATATTGA TACTACAATT
AACAAGATGA ATTCCATTAT TTCTTTACAA ACTACATCTT CTCCTAATGT AGTAGCAACA
AAAGTGAGTA ATTTAGAGTT AACAATTTTA CAAGGAAAAA TAATTGATGT CGTCTTAGAA
ACTGCTATTA ATTCAGATTT ACAAGGTACA CTAAGAGGAA TAGTAGCAAG AGACGTCTAT
GCTGAAGCTA GCAATACAGT AATGATTCCA AAAGGATCTA GATTAATAGG AAGTTATTCT
TTTGATGCTA GCCCCGGAAA AACTAGAGTT CAAATATCTT GGAATAGAGT TATCCTCCCT
CATGGTATTG ACATAACACT TGACTCCAAC GGAACAGATG AATTAGGAAG ACAAGGAGCT
TCAGGTGTTG TTGATACAAA AATAGGCAAT ATATTAACTT CAACAATACT GTTAGCTGGT
GTATCTATAG CAACATCCTA TGCTACATCA AAAATCCCTG AGATTAACAA CTATCCTATT
TTAGAGTCAG ATAGCAAAGA AAAAAAGGAT AAGGAGAAAG ATGACACAGG TGATAAGTCT
AAATCTACAA AAACTACATT ACCTGTAAAA ATTTTATCTC AGGCAGTAGA TGATTTTTCA
AACTCTATAA AAGATATAAT AAAGAAATAT TCTAATAATA ATCCTACAGT ATACGTAGAC
CAAGGTACTT TGCTAAAAGT ATTTGTTAAT AAAGACATTG TGTTCCCAAA GTCAGCAGTT
CGTGGAATAG ACATTGTTAA TTAA
 
Protein sequence
MSEENQNNQY TEIEENVNVV GVNKGKKFIT IGVMLVGLGF AYYYFFTGKK PDDSLNKPQT 
TEEADIEKLL KESVPPTQEV SPPINIPPQL PELPPLVSPS LPSIPTVEKP KVLEIPKIPK
IKPKQPVEIK PKPQPQPEPL PKIPLPVQNK IDIAAPIAPI TTGYDKERRA TSMLAVSGGQ
NVASGTSENG ERESNIDTTI NKMNSIISLQ TTSSPNVVAT KVSNLELTIL QGKIIDVVLE
TAINSDLQGT LRGIVARDVY AEASNTVMIP KGSRLIGSYS FDASPGKTRV QISWNRVILP
HGIDITLDSN GTDELGRQGA SGVVDTKIGN ILTSTILLAG VSIATSYATS KIPEINNYPI
LESDSKEKKD KEKDDTGDKS KSTKTTLPVK ILSQAVDDFS NSIKDIIKKY SNNNPTVYVD
QGTLLKVFVN KDIVFPKSAV RGIDIVN