Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1041 |
Symbol | virB4-2 |
ID | 3927466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 1071112 |
End bp | 1073487 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902155 |
Product | type IV secretion system protein VirB4 |
Protein accession | YP_507826 |
Protein GI | 88657785 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | [TIGR00929] type IV secretion/conjugal transfer ATPase, VirB4 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTG TGAAAGAAAT GATCGGCCAT TCTTCTGATA TGAATAATTT TTCTAGAAAA AGACGAGATA ATACCTCTAG TAAAGGAGAT TTTATTCCTG CAGCTTGTCA TTATGATGAG AATACAATAT TGAATAAAGA TGGTGAACTT GTACAAATAA TAAAGATAGA GGATTATGTA CTTACTCATT ATGTTAATGA TAAAGATTTA AGAACAGTAG TGCGTAATAG TATAGTTAAT AGTGTTGAAG TTCCAGAAGT TTCTTTTTGG ATTTATACTG TAAGGAAACC ACATAAGTTT GATTTTGCAA GAAAAAGTAT AAACGATGTT TCTGATGCTT TAGGTAGTGC TCATCTTAAT AATATAGGTC AACGTGTGAC TTATATTAAT GAGTTATATA TAGCAGTGGT TACTAATCAC TTACCTGAAA GTATGAAAGG AGTGTTAGGT GCTCTGTCAT TCTCCTATGT AAAAAATAAG CATAAAGATT TTTTAAAAAA TAAAATAGAC AGATTAAATA AGGTCACTGC AAGTATTTTA GAAAATTTGA AAAAGTTTGA AGTCAGAAAG TTAGGATTGA TAGTAATTGA TAAAGAAAAA GTAAGATCGG AGTTGATAGA GTTTTTGTAT TATTTAACTA TGATGCATCA TAAAGAATGT TTCCTTGATA TGGTAGATAT TTCTGGTATA TGTAGTCATT GTAGTATTAG TATGGGATTT AATACATTTA AAATTTCATG TGATAATAAC CAAAGATTTG GTGCAATATT AGCAATTAAA GATTATCAAG ATTCTCCATT AGATGCGGTA GATGAATGTC TACAGCAGGA TTATGGATTT ATCGTAGTTG AAATTATAAA GTTTGCAAAA AGTAAGAATG CATTAAAGCT TTTTCAAAAG CAAGCTACAT TTTTAGAATG TAGCAATGAT TTTCAGTTGA GAAAGTTATC CAATATAGAT GATTTCGTAT CAGTTGATCC AAACTCTAAT TTAAGTTTCT GCGAACGCAA AATAAACTTT GTAATAATGT CAGATACTTT ACCACAGCTT CATAATAATA TAGATAGGGC TGTTAATTCA TTGTCTTCAC TTGGTATTAT TTGTGTCAGG TGTGATTTGA GCATGGAAGA TGATTTTTGG GCACATTTAC CTGGCAATTT TTCTTATATT TTAAACTTTA GGTATACGTT AATAAAATAT GCTTGTGCAT TTTCGTTGTT GCATTATTTC CCTTCAGGAG CACTTCAAGG AAACAAGTGG GGACAAGCAA TTACTATGTT TTTTTCAAAT AAGGGTAAAC CTTATTTCTT TAGTTTTCAT GTGTTTGATA AAGGACACAC TTTAATGGTT GGTAGTCCTC AATCTTCAGT TACTATGTTA CTGAACTTTT TATTGTCAGA ATCTATGCAG TTGAATGCAC GAATTGTCTT GCTAGATTAT ACTGGTAAGT CTATTGTTTT TGTTAAGGCT ATGGGTGGTC AGTATTATAG AGCAGACCAT AGGCGTGATT ATCAGGAAAT GTCGTTTAAT TTCTTTCAAG TTGAAGATAC TGCACTTAAT CGTAGAATCG TTACTGGTGT TTTGCAAAGA ATGTTGAATG TTAAAAACAT AACTGAGGAA GTTAATAGTG CAATAGATAG AATAGTTAAT GATCTTTTTA CGTTGCCTCT TGAGTCTAGA ACTATAAATA GTATTGCTGA CCATGTCAGT ACACTGGGTA CCAATGCTAG TCAATGGTTA AATAATGGGG AGTTTGCGCA TTTACTAAAG GAAGATGCTA ATATTGATTG GGCAGCAAAA GTTTTAGGGT TGAATATTGG TATTTTGTTC TCTAAACCTC AATGTGCTTC TGTTATTGTT TATTACTTTT TGCATGCTTT AATTAATTAT CTTGATGGGT CTCCTACGGT TTTAGTAATA GATGAAGCAT GGATTTTGGA TTATGTTTTT ACTAGTGATC AGGAATTTGA TGAGTGGATT GAAATGATGA ACAAATTAAA TGTTGTTGTT GTATTTGCTG GTGAAAACAT TCCAGCTATT ATTTCCAGTA ATATCATTTG TAGGTTTAAT CAGCATGTTG AAACACAAGT TTTCATGCCA AATTCAGTAT CAACTAATAA AATGTATATG AGGGCATTTA ATCTATCAAA GTCAGAATGT AATACTATGT TTCAAATGCC ATCTCAGGAA GGATATTTTT TCGTAAAGCA GGATAATGAT TCAGTAGTAT TGTCTTTCAA TTTGCCAAAT ATACCAGAAA CTAATGTTCT TTCTGCTAAT AAGAATACAA TTCGGTATAT GTATGAATCT ATTAGTAGTC ATGGGGATAA TGTAAGAGAA TGGCTGCCTG CATTTTATAA GAAATGTGGA GCTTAA
|
Protein sequence | MSFVKEMIGH SSDMNNFSRK RRDNTSSKGD FIPAACHYDE NTILNKDGEL VQIIKIEDYV LTHYVNDKDL RTVVRNSIVN SVEVPEVSFW IYTVRKPHKF DFARKSINDV SDALGSAHLN NIGQRVTYIN ELYIAVVTNH LPESMKGVLG ALSFSYVKNK HKDFLKNKID RLNKVTASIL ENLKKFEVRK LGLIVIDKEK VRSELIEFLY YLTMMHHKEC FLDMVDISGI CSHCSISMGF NTFKISCDNN QRFGAILAIK DYQDSPLDAV DECLQQDYGF IVVEIIKFAK SKNALKLFQK QATFLECSND FQLRKLSNID DFVSVDPNSN LSFCERKINF VIMSDTLPQL HNNIDRAVNS LSSLGIICVR CDLSMEDDFW AHLPGNFSYI LNFRYTLIKY ACAFSLLHYF PSGALQGNKW GQAITMFFSN KGKPYFFSFH VFDKGHTLMV GSPQSSVTML LNFLLSESMQ LNARIVLLDY TGKSIVFVKA MGGQYYRADH RRDYQEMSFN FFQVEDTALN RRIVTGVLQR MLNVKNITEE VNSAIDRIVN DLFTLPLESR TINSIADHVS TLGTNASQWL NNGEFAHLLK EDANIDWAAK VLGLNIGILF SKPQCASVIV YYFLHALINY LDGSPTVLVI DEAWILDYVF TSDQEFDEWI EMMNKLNVVV VFAGENIPAI ISSNIICRFN QHVETQVFMP NSVSTNKMYM RAFNLSKSEC NTMFQMPSQE GYFFVKQDND SVVLSFNLPN IPETNVLSAN KNTIRYMYES ISSHGDNVRE WLPAFYKKCG A
|
| |