Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0264 |
Symbol | |
ID | 3927956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 247080 |
End bp | 250067 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 637901388 |
Product | hypothetical protein |
Protein accession | YP_507085 |
Protein GI | 88658277 |
COG category | [S] Function unknown |
COG ID | [COG3164] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAA AAAAAAAGTT TATAAGATTA TTTTTTATAC TGATGATCTC TATTGTGTGT AGTGTATCCT ACCTGTATGT TCGTGATAGA GATGTTATAG AGATTAATGC AAAATTTTTG AATTTTTATG TAAAACATAA ACTTGCTACG GTTTTTCCTG GTTCTGAGAT AGATATAGGT AGTACTACGT TAACTTGGCA AAATAAGAGT GAAAATTTTA TATTATCATC ACAAAATCTT ACTATTAAAA GCCCTAAGCT TGGGGCAAAT ATTACAATAC CTAAGTTCCT GTTATATTCT AGAGTTGGGA TATTATTTTT GTGGGGAAAT TGTGATTTTT CTTATATTGA TATTCCTAAA ATCCGTATTG AGTTCTATGA CATTCAAAAA AAAGTTGGAA TAAATCTTGT AGAGAATTCT GTAAAAATTC TTAAAGATAT GCTGTTAAAG GCAATCAAAC TCGATATACC AGTATTTATT AGTAATGCTA TTCTTGCTAA ACATGGTAAG GAAGATCTAA TGATAAAACA ACTTAGTGTC AAAATGAATA AAGGGTATGA TAAAAATACT ATTGATTTTA ATGTCGAAAA TGACAATTCT TTCATCAGTA TGACAGTGCA TGAACACTAC AAAGGTATAA TGTCTTTTGA GATATCATAT GGTAATTTTA ATACTAAGGT ATTAGGATAT TTAGGTAATT TAAATGCAAA GCTTCCGCTG TATGATAATT TAAATTTTAG TGGTGATATT GTATTGAGAA TTGATGAACA TGATGAAATT ACATATGGTG ATGTAAATAT TCAAAGAATA AAAGGTGTTG TTCCTTGTGT TTCTTCTCGT AAATGCTACA TCCATGATTT TAGGACTAGA TTAGTATATC AAAATGATAT TCTATCAGTG AAAAATTTTT TTTTATTGTT AGATCAAAGT AAAATAGTTG CTACTGGAAT GATTGATAAC GAAAATGTGG ACTTGACTTT TAATTTTGAT GTTATATTTC CAAAGACGCT GTGTGATTAC TGGTTTGTTA ATTTATATCC TGATTTAAAT CGTTGGTATT GTACTAATGT GACAGACGGA AAAATTACTG ATCTGAAACT TCAAATTAAA GGAAAACGCG ATGGTATGTT CATCTATAAT GATCTTTCAA ACTATAATGT TAGTGCAAAT ATAGAAAATG TTTCGATAAA ATTCAACCAT AATTTTGATC CAGTGCGTAT AGTGCATGGT AAATTATATT TACGTGATAA TCAATTTATT ATTACTTCTG ATAATTCGGA TTTTAAAGGT GTAGTTATTC AAGATGGAAT ATTAAAAATA GATAATTTAA AAGATGAAAA TGCAGTGATG ACCATTAGTG GTAGTTCTGT GGGTGACGTA AAGCAATTAT ATAAAGCTGT AAATAAAGAT GAATTTATAC TGTTAGATAA TAACAAAATT TTTGGTAAGT CTCATACTAA CTTTGATTTT AAGATATTTA ATTTACTAAA TGACGATATT GTAGATTATG TATCTTATAT ACATGCACAA ATTGAATCTT TTAAAGCTGG TAGTATTTTG AATACTTTTG ATGTTTATGA TGCTAAGGTT GATGTTACAT TACATGATAA CGATGTGAAT ATAAATACTC AGGGTTATAT GAATGGCTTT CCTATGTCTT TGAAAGTGGA TAGAAATTTG CAAGATAATT ATAAATTTCA CTATGAATTT GCAGGGTATA TTTCAGCGGA CAATATAAAG GAGTTAGGAA TTTTTGAATA TGGAGATTGT TCAGGAGTAA TGAAGTCAAA CCTTCAATGG GATATTAATG CCAATAATAC AGTCATTACA GGGGATGTTG ATCTTTCTCA GTTAAAAATT CACTTTGATG GATTAACACA TAATAATTTT GATAGTGTAG TAAAGTTTTC TGCTTCATTT CAGGATAAGA AAGAAATTAA AATAGATAGT GCATCTATAG TTGGTAAAGG TGTAGATATA GAACTCAGTG GTAAAACGGG AGCCAATTTA GAATTATTAT TAAATAAGGT AAAATTGAAA GATACAGATA TTACAGCTGA GTTAAAGCGG AGTAATGATT CTATTACAGT AAAAGTATTC GGGGAATCTT TAGATTTAAG TACTGTTGAT TTTTCTGAGA TGATGAAAGG GGAATCTCCG ATGCAACAAA CTAAAATTGA TGTCAATGTA AGTCGCGTTA TGATGAAAAA TAATGTTGTA GCTAATAATG TAGATTTCAA ATTAAGTTGT TATGATACTG TTTGTAATGA AATAAAATTG ACAGGAAATT TTCTAGATAA TAGTAATTTT AGTTTAGAGT ATGGGCCTAT TGGATTAGAA ATAAATACTG ATAATGCAGG GGAACTTCTT CGTGCGATAG ATATTTTAAA AGTTGTAGAT AAAGGAAAAT TATCTTTCTA TATGTATCCG GTAAAAGCTG GTGAAATAAC TTCTGGTATG TTTTCATTAA CAAATTTTCA TCTTGTAAAT GCTTCCATTT TGGCTCAAAT ATTAACATTA TCCTCACTAA AAGGAGTAGT TAATACATTA AATGGTAAAG GAATATATTT CAATGCATTG AATGCACCAT TTACTTATCA AGATAACTTA ATAAGTATAG ATGAGTCTTG GATAGAGGGT TCTGAGTTAG GTATTAGTTT AGGTGGGGAA ATTGATCTAA ATACAAAGAT GTTTAACATA AAAGGACAAA TAATTCCTGC ATATGTGATT AATAAAATTA TATGGCAAAC TCCAATTATT GGAAAGTTGT TAACCGGAGG ACAAAGTCGT GGTGTTATCG CTATAGATTA CAAGGTAAAA GGTACAGATA AAGATCATGA TTTATCAGTC AATTTGATGT CTATTTTGAC ACCTAACTTA TTAAAAAGGG TATTAAAAAT ATTTGATAGT AAATTGTTAA AAAAAGAGCA TGTCAAGAAA AGATCATCTG TTAATAAGGT AAATAATCAG GTGAGAATGG TATCATAA
|
Protein sequence | MLKKKKFIRL FFILMISIVC SVSYLYVRDR DVIEINAKFL NFYVKHKLAT VFPGSEIDIG STTLTWQNKS ENFILSSQNL TIKSPKLGAN ITIPKFLLYS RVGILFLWGN CDFSYIDIPK IRIEFYDIQK KVGINLVENS VKILKDMLLK AIKLDIPVFI SNAILAKHGK EDLMIKQLSV KMNKGYDKNT IDFNVENDNS FISMTVHEHY KGIMSFEISY GNFNTKVLGY LGNLNAKLPL YDNLNFSGDI VLRIDEHDEI TYGDVNIQRI KGVVPCVSSR KCYIHDFRTR LVYQNDILSV KNFFLLLDQS KIVATGMIDN ENVDLTFNFD VIFPKTLCDY WFVNLYPDLN RWYCTNVTDG KITDLKLQIK GKRDGMFIYN DLSNYNVSAN IENVSIKFNH NFDPVRIVHG KLYLRDNQFI ITSDNSDFKG VVIQDGILKI DNLKDENAVM TISGSSVGDV KQLYKAVNKD EFILLDNNKI FGKSHTNFDF KIFNLLNDDI VDYVSYIHAQ IESFKAGSIL NTFDVYDAKV DVTLHDNDVN INTQGYMNGF PMSLKVDRNL QDNYKFHYEF AGYISADNIK ELGIFEYGDC SGVMKSNLQW DINANNTVIT GDVDLSQLKI HFDGLTHNNF DSVVKFSASF QDKKEIKIDS ASIVGKGVDI ELSGKTGANL ELLLNKVKLK DTDITAELKR SNDSITVKVF GESLDLSTVD FSEMMKGESP MQQTKIDVNV SRVMMKNNVV ANNVDFKLSC YDTVCNEIKL TGNFLDNSNF SLEYGPIGLE INTDNAGELL RAIDILKVVD KGKLSFYMYP VKAGEITSGM FSLTNFHLVN ASILAQILTL SSLKGVVNTL NGKGIYFNAL NAPFTYQDNL ISIDESWIEG SELGISLGGE IDLNTKMFNI KGQIIPAYVI NKIIWQTPII GKLLTGGQSR GVIAIDYKVK GTDKDHDLSV NLMSILTPNL LKRVLKIFDS KLLKKEHVKK RSSVNKVNNQ VRMVS
|
| |