Gene ECH_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1068 
Symbol 
ID3927971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1096827 
End bp1098047 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content29% 
IMG OID637902182 
ProductC-type cytochrome family protein 
Protein accessionYP_507853 
Protein GI88657827 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.876308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTTA ATGTTAATTC AATATACATT TTGATATGTG CTTTACTTGG CGGCATAATT 
TTAAATTGTA TGCCATGTGT ATTCCCTATC TTATCATTAA AAATAATGTC AATGGTTAAG
AATGCTCAAA AGAGTAAAAT ATTGATAAGA GCTGATGGGG TATTATATAC ACTTGGAGTT
ATGGTGAGTA TGTTTCTACT GTCTTCAATA CTTTTAATAT TACGTCATTT TGGCTATTTA
GTAAGTTGGG GATATCAAAT GCAATTTCCA ATATTAATTG CATTGTTGAT GTATATAATG
TTTTTGATGG GATTATCTTT TTCTGGATTT TATGATTTGC CTTTTATTGT TCCTAATTTT
AATAATGTGA ATTCTAAAAG AGAAGGATTA ATAGGTAGTT TTATTGTGGG TATGTTATCA
ACGTTTGTTG CTACCCCGTG TACAGCCCCT TTTATGGTAT CTGCTGTAAC TGTAGCTTTA
AATCAATCTA ACCTTTACTC AGTGTTGATT TTTCAAGTTC TTGGTTTTGG AATTGCGTTG
CCTTATTTAT TATTGTCATT TTTTCCAGGT TTATTGAAAA TTATTCCTAA GCCAGGTAGA
TGGATGGAAG TTTTACAAAG ATTTTTAGCT TTCCCACTTT ATTTTTCTTC AGCGTGGTTA
CTCTGTATTT TAATCAAACA GAAAGGTCCG GAAATATTAT TTGCTGTGTT ATCTTGTGCT
ATATTATTTG TCATGGGGAT ATGGATTATG AAATTTATAA AATCTTGGGA ACCTATAAGT
AAGTTTGTAA TTTGCTTGTG TTTATTGTTT ATAGCAATTT CTCCTTTATG TTTTGAGCCA
ATAAAAGAGT TTCTAATGAA ACATAAGGAA GCAAAACATG TTGTAGTAAT GGAATTTTCT
CAAAAGAAGT TAGAACAACT ACTTGAAGCG AAAGAGACAG TATTGTTGTC TGTAAGTGCA
GATTGGTGTT TAACTTGTAA AGTTAATGAA AAAATTTTAC AGTTGGACAC TGTACAGGCT
TTGCTTATGA AGAAAAAAAT TTACTACATG AGAGGTGATT TAACTTCTAA AAATTATGAA
TTAACAGAGT ATATTAATCA GTTGGATAAA AATAGTGTAC CGCTTTATGT ATTGTATGTT
GATGGAGTTA AGATTAAAGT TTTACCACAA GTCCTCAGTG AAAAGATAGT AATTGATATT
ATAAACAAAT ATGTAAAATA G
 
Protein sequence
MFFNVNSIYI LICALLGGII LNCMPCVFPI LSLKIMSMVK NAQKSKILIR ADGVLYTLGV 
MVSMFLLSSI LLILRHFGYL VSWGYQMQFP ILIALLMYIM FLMGLSFSGF YDLPFIVPNF
NNVNSKREGL IGSFIVGMLS TFVATPCTAP FMVSAVTVAL NQSNLYSVLI FQVLGFGIAL
PYLLLSFFPG LLKIIPKPGR WMEVLQRFLA FPLYFSSAWL LCILIKQKGP EILFAVLSCA
ILFVMGIWIM KFIKSWEPIS KFVICLCLLF IAISPLCFEP IKEFLMKHKE AKHVVVMEFS
QKKLEQLLEA KETVLLSVSA DWCLTCKVNE KILQLDTVQA LLMKKKIYYM RGDLTSKNYE
LTEYINQLDK NSVPLYVLYV DGVKIKVLPQ VLSEKIVIDI INKYVK