Gene ECH_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1041 
SymbolvirB4-2 
ID3927466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1071112 
End bp1073487 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content31% 
IMG OID637902155 
Producttype IV secretion system protein VirB4 
Protein accessionYP_507826 
Protein GI88657785 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID[TIGR00929] type IV secretion/conjugal transfer ATPase, VirB4 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTG TGAAAGAAAT GATCGGCCAT TCTTCTGATA TGAATAATTT TTCTAGAAAA 
AGACGAGATA ATACCTCTAG TAAAGGAGAT TTTATTCCTG CAGCTTGTCA TTATGATGAG
AATACAATAT TGAATAAAGA TGGTGAACTT GTACAAATAA TAAAGATAGA GGATTATGTA
CTTACTCATT ATGTTAATGA TAAAGATTTA AGAACAGTAG TGCGTAATAG TATAGTTAAT
AGTGTTGAAG TTCCAGAAGT TTCTTTTTGG ATTTATACTG TAAGGAAACC ACATAAGTTT
GATTTTGCAA GAAAAAGTAT AAACGATGTT TCTGATGCTT TAGGTAGTGC TCATCTTAAT
AATATAGGTC AACGTGTGAC TTATATTAAT GAGTTATATA TAGCAGTGGT TACTAATCAC
TTACCTGAAA GTATGAAAGG AGTGTTAGGT GCTCTGTCAT TCTCCTATGT AAAAAATAAG
CATAAAGATT TTTTAAAAAA TAAAATAGAC AGATTAAATA AGGTCACTGC AAGTATTTTA
GAAAATTTGA AAAAGTTTGA AGTCAGAAAG TTAGGATTGA TAGTAATTGA TAAAGAAAAA
GTAAGATCGG AGTTGATAGA GTTTTTGTAT TATTTAACTA TGATGCATCA TAAAGAATGT
TTCCTTGATA TGGTAGATAT TTCTGGTATA TGTAGTCATT GTAGTATTAG TATGGGATTT
AATACATTTA AAATTTCATG TGATAATAAC CAAAGATTTG GTGCAATATT AGCAATTAAA
GATTATCAAG ATTCTCCATT AGATGCGGTA GATGAATGTC TACAGCAGGA TTATGGATTT
ATCGTAGTTG AAATTATAAA GTTTGCAAAA AGTAAGAATG CATTAAAGCT TTTTCAAAAG
CAAGCTACAT TTTTAGAATG TAGCAATGAT TTTCAGTTGA GAAAGTTATC CAATATAGAT
GATTTCGTAT CAGTTGATCC AAACTCTAAT TTAAGTTTCT GCGAACGCAA AATAAACTTT
GTAATAATGT CAGATACTTT ACCACAGCTT CATAATAATA TAGATAGGGC TGTTAATTCA
TTGTCTTCAC TTGGTATTAT TTGTGTCAGG TGTGATTTGA GCATGGAAGA TGATTTTTGG
GCACATTTAC CTGGCAATTT TTCTTATATT TTAAACTTTA GGTATACGTT AATAAAATAT
GCTTGTGCAT TTTCGTTGTT GCATTATTTC CCTTCAGGAG CACTTCAAGG AAACAAGTGG
GGACAAGCAA TTACTATGTT TTTTTCAAAT AAGGGTAAAC CTTATTTCTT TAGTTTTCAT
GTGTTTGATA AAGGACACAC TTTAATGGTT GGTAGTCCTC AATCTTCAGT TACTATGTTA
CTGAACTTTT TATTGTCAGA ATCTATGCAG TTGAATGCAC GAATTGTCTT GCTAGATTAT
ACTGGTAAGT CTATTGTTTT TGTTAAGGCT ATGGGTGGTC AGTATTATAG AGCAGACCAT
AGGCGTGATT ATCAGGAAAT GTCGTTTAAT TTCTTTCAAG TTGAAGATAC TGCACTTAAT
CGTAGAATCG TTACTGGTGT TTTGCAAAGA ATGTTGAATG TTAAAAACAT AACTGAGGAA
GTTAATAGTG CAATAGATAG AATAGTTAAT GATCTTTTTA CGTTGCCTCT TGAGTCTAGA
ACTATAAATA GTATTGCTGA CCATGTCAGT ACACTGGGTA CCAATGCTAG TCAATGGTTA
AATAATGGGG AGTTTGCGCA TTTACTAAAG GAAGATGCTA ATATTGATTG GGCAGCAAAA
GTTTTAGGGT TGAATATTGG TATTTTGTTC TCTAAACCTC AATGTGCTTC TGTTATTGTT
TATTACTTTT TGCATGCTTT AATTAATTAT CTTGATGGGT CTCCTACGGT TTTAGTAATA
GATGAAGCAT GGATTTTGGA TTATGTTTTT ACTAGTGATC AGGAATTTGA TGAGTGGATT
GAAATGATGA ACAAATTAAA TGTTGTTGTT GTATTTGCTG GTGAAAACAT TCCAGCTATT
ATTTCCAGTA ATATCATTTG TAGGTTTAAT CAGCATGTTG AAACACAAGT TTTCATGCCA
AATTCAGTAT CAACTAATAA AATGTATATG AGGGCATTTA ATCTATCAAA GTCAGAATGT
AATACTATGT TTCAAATGCC ATCTCAGGAA GGATATTTTT TCGTAAAGCA GGATAATGAT
TCAGTAGTAT TGTCTTTCAA TTTGCCAAAT ATACCAGAAA CTAATGTTCT TTCTGCTAAT
AAGAATACAA TTCGGTATAT GTATGAATCT ATTAGTAGTC ATGGGGATAA TGTAAGAGAA
TGGCTGCCTG CATTTTATAA GAAATGTGGA GCTTAA
 
Protein sequence
MSFVKEMIGH SSDMNNFSRK RRDNTSSKGD FIPAACHYDE NTILNKDGEL VQIIKIEDYV 
LTHYVNDKDL RTVVRNSIVN SVEVPEVSFW IYTVRKPHKF DFARKSINDV SDALGSAHLN
NIGQRVTYIN ELYIAVVTNH LPESMKGVLG ALSFSYVKNK HKDFLKNKID RLNKVTASIL
ENLKKFEVRK LGLIVIDKEK VRSELIEFLY YLTMMHHKEC FLDMVDISGI CSHCSISMGF
NTFKISCDNN QRFGAILAIK DYQDSPLDAV DECLQQDYGF IVVEIIKFAK SKNALKLFQK
QATFLECSND FQLRKLSNID DFVSVDPNSN LSFCERKINF VIMSDTLPQL HNNIDRAVNS
LSSLGIICVR CDLSMEDDFW AHLPGNFSYI LNFRYTLIKY ACAFSLLHYF PSGALQGNKW
GQAITMFFSN KGKPYFFSFH VFDKGHTLMV GSPQSSVTML LNFLLSESMQ LNARIVLLDY
TGKSIVFVKA MGGQYYRADH RRDYQEMSFN FFQVEDTALN RRIVTGVLQR MLNVKNITEE
VNSAIDRIVN DLFTLPLESR TINSIADHVS TLGTNASQWL NNGEFAHLLK EDANIDWAAK
VLGLNIGILF SKPQCASVIV YYFLHALINY LDGSPTVLVI DEAWILDYVF TSDQEFDEWI
EMMNKLNVVV VFAGENIPAI ISSNIICRFN QHVETQVFMP NSVSTNKMYM RAFNLSKSEC
NTMFQMPSQE GYFFVKQDND SVVLSFNLPN IPETNVLSAN KNTIRYMYES ISSHGDNVRE
WLPAFYKKCG A