Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1068 |
Symbol | |
ID | 3927971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 1096827 |
End bp | 1098047 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 637902182 |
Product | C-type cytochrome family protein |
Protein accession | YP_507853 |
Protein GI | 88657827 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.876308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTTA ATGTTAATTC AATATACATT TTGATATGTG CTTTACTTGG CGGCATAATT TTAAATTGTA TGCCATGTGT ATTCCCTATC TTATCATTAA AAATAATGTC AATGGTTAAG AATGCTCAAA AGAGTAAAAT ATTGATAAGA GCTGATGGGG TATTATATAC ACTTGGAGTT ATGGTGAGTA TGTTTCTACT GTCTTCAATA CTTTTAATAT TACGTCATTT TGGCTATTTA GTAAGTTGGG GATATCAAAT GCAATTTCCA ATATTAATTG CATTGTTGAT GTATATAATG TTTTTGATGG GATTATCTTT TTCTGGATTT TATGATTTGC CTTTTATTGT TCCTAATTTT AATAATGTGA ATTCTAAAAG AGAAGGATTA ATAGGTAGTT TTATTGTGGG TATGTTATCA ACGTTTGTTG CTACCCCGTG TACAGCCCCT TTTATGGTAT CTGCTGTAAC TGTAGCTTTA AATCAATCTA ACCTTTACTC AGTGTTGATT TTTCAAGTTC TTGGTTTTGG AATTGCGTTG CCTTATTTAT TATTGTCATT TTTTCCAGGT TTATTGAAAA TTATTCCTAA GCCAGGTAGA TGGATGGAAG TTTTACAAAG ATTTTTAGCT TTCCCACTTT ATTTTTCTTC AGCGTGGTTA CTCTGTATTT TAATCAAACA GAAAGGTCCG GAAATATTAT TTGCTGTGTT ATCTTGTGCT ATATTATTTG TCATGGGGAT ATGGATTATG AAATTTATAA AATCTTGGGA ACCTATAAGT AAGTTTGTAA TTTGCTTGTG TTTATTGTTT ATAGCAATTT CTCCTTTATG TTTTGAGCCA ATAAAAGAGT TTCTAATGAA ACATAAGGAA GCAAAACATG TTGTAGTAAT GGAATTTTCT CAAAAGAAGT TAGAACAACT ACTTGAAGCG AAAGAGACAG TATTGTTGTC TGTAAGTGCA GATTGGTGTT TAACTTGTAA AGTTAATGAA AAAATTTTAC AGTTGGACAC TGTACAGGCT TTGCTTATGA AGAAAAAAAT TTACTACATG AGAGGTGATT TAACTTCTAA AAATTATGAA TTAACAGAGT ATATTAATCA GTTGGATAAA AATAGTGTAC CGCTTTATGT ATTGTATGTT GATGGAGTTA AGATTAAAGT TTTACCACAA GTCCTCAGTG AAAAGATAGT AATTGATATT ATAAACAAAT ATGTAAAATA G
|
Protein sequence | MFFNVNSIYI LICALLGGII LNCMPCVFPI LSLKIMSMVK NAQKSKILIR ADGVLYTLGV MVSMFLLSSI LLILRHFGYL VSWGYQMQFP ILIALLMYIM FLMGLSFSGF YDLPFIVPNF NNVNSKREGL IGSFIVGMLS TFVATPCTAP FMVSAVTVAL NQSNLYSVLI FQVLGFGIAL PYLLLSFFPG LLKIIPKPGR WMEVLQRFLA FPLYFSSAWL LCILIKQKGP EILFAVLSCA ILFVMGIWIM KFIKSWEPIS KFVICLCLLF IAISPLCFEP IKEFLMKHKE AKHVVVMEFS QKKLEQLLEA KETVLLSVSA DWCLTCKVNE KILQLDTVQA LLMKKKIYYM RGDLTSKNYE LTEYINQLDK NSVPLYVLYV DGVKIKVLPQ VLSEKIVIDI INKYVK
|
| |