Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1083 |
Symbol | |
ID | 3927854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1109879 |
End bp | 1111702 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 637902197 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_507868 |
Protein GI | 88658408 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.581758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCATT GTATAAAACA CACTTTTACA GTTTTAATAA TATATACATT ACTTTTGAAC ACTAATATAT GTTTTGCCCA TACGACAGAT CAAGATAATT TTTTTACTGT AAAACGCACT TCTTATACTA AAGAAGATTT TGTAAGCTTC ATAGTAAAAT GTCATGCTAA AGGAATACCT TTAGATTTCA AGAAGGAATT CGGCAATAAT TTATCTGGAG CAGATTTTAG TGATCTAGAC CTTCGAGGAT CAGTATTTGA TAACGTTAAC CTACTACATG CAAATTTTAC TAGAGCTAAC TTATCCAATT CCACTTTTAT AGATTCCAAT ATGCAAGGAG CATCCTTCAT TAATGCAAAC TTAAGTAAAT CCAACATAAA AAATTCAAAC TTAAATTTTG CCAATTTCAC ATCTGCAGAT TTACAAAAAA CTACAATTAC ACAATCAAAA ATCAACAATA CAAACTTTAG TTACTCAGAC ATGCGCTTTT CTATTCTAAC AGAAGTCAAT GGATCCCTTG CAAACTTTTC TGAAACAGAA TTAAAACTAG TTTCAATATT CAACTCTAAT ATAGAAAAAG CAAATTTTCA TGACATTGAA GGTGAAAATA TAGTAATCCA AAATTGTAAT TTACTAGATT CAAACTTTTT TGGAGCAAAT TTAAATAATT CAAAAATTCA ATTTTCAAAT CTCACAAATA CTATCCTATA CGCAGTAAAT CTAGATTCAT CAGATATAAC AGGAAGTAAC CTAAGTAATA GCAACATGGA AGTATCAAAT ATTTCTTTTG CTAATTTTCA CAATGCTAAT TTAAGCAATA CAAACATGCA TCTTGTAGAT GCTCATTATA CAAATCTTAA TAATACAAAT TTACATAAAG TTCGTATGAG TGATGCAACT CTAATTGGAG TAAATTTAAA TCATGCTGTG TTATCTAACG CAGATTTGAG CAACGCAGAT TTAAGCAATG TCAGCATAGA AGAATCAGAT ATGCAAGAAA TACTTCTTGA CTCTACTAAT ATTACAAGAA CGAATATGAA GCATGCTATT TTAAATAACG CAACTATTAA TAATACAGTT ATTTCAAATT CTGACCTTAC GATGACATTA ATAAAAGATT CTAAATGGAA TAACTCTAGT ATCTATTCAA GCAAATTAAC CTCTGCAGTT ATAGATAACA ACATTATTAA ACATGGTACA TATGATAAAG TAAATGCAAC AAATACATTA TGGTCTAATT CAACAATAGA AGATAGCAAC ATAATTTTCT GCAATTTTGA ATCTGCAATA TTCAATAATG ATAAAATACA TAAAGTGAGC TTTTTTACAA ATCACTTAGA ATCCGCAAAA GTAGAAGATT CAAACATAGT CGGATCTAGC ATTTACAAAA ATACTCTCAC CAATTTCATA ATTAATAATT GTGACATTAA AAATACAATC ATTATCAACA ATAAGAATTT TGTACCAACA GAATTTTCTA AACAGGCAAT TACATCTATT ACAGCTTTAC AAGACGCTAT CACAAACAAT AAAGTATTTG ATGTCAATTT TTCTAATTTT GATTTTAGAA AAATAAACTT AAACAATGCA AATTTTTCAA ACTCTATATT AAAGAATGCA AACTTTTCAG GATCACAATT AGAAAACGTA AATTTTACTA ACACAAATCT ACAATCTTCT CAATTTGAAA ACTCACAACT TAAAAATGTA GATTTTTCTA ATGCAAATTT AGAAAACGCG AACTTATCGA ACAGTAAATT AAATGATATC AACACATACA AAACAAATTT GCGTCACGTA AAAACTGACA ATACTCAGAA ATAA
|
Protein sequence | MKHCIKHTFT VLIIYTLLLN TNICFAHTTD QDNFFTVKRT SYTKEDFVSF IVKCHAKGIP LDFKKEFGNN LSGADFSDLD LRGSVFDNVN LLHANFTRAN LSNSTFIDSN MQGASFINAN LSKSNIKNSN LNFANFTSAD LQKTTITQSK INNTNFSYSD MRFSILTEVN GSLANFSETE LKLVSIFNSN IEKANFHDIE GENIVIQNCN LLDSNFFGAN LNNSKIQFSN LTNTILYAVN LDSSDITGSN LSNSNMEVSN ISFANFHNAN LSNTNMHLVD AHYTNLNNTN LHKVRMSDAT LIGVNLNHAV LSNADLSNAD LSNVSIEESD MQEILLDSTN ITRTNMKHAI LNNATINNTV ISNSDLTMTL IKDSKWNNSS IYSSKLTSAV IDNNIIKHGT YDKVNATNTL WSNSTIEDSN IIFCNFESAI FNNDKIHKVS FFTNHLESAK VEDSNIVGSS IYKNTLTNFI INNCDIKNTI IINNKNFVPT EFSKQAITSI TALQDAITNN KVFDVNFSNF DFRKINLNNA NFSNSILKNA NFSGSQLENV NFTNTNLQSS QFENSQLKNV DFSNANLENA NLSNSKLNDI NTYKTNLRHV KTDNTQK
|
| |