Gene ECH_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0033 
Symbol 
ID3927815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp28890 
End bp30071 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content32% 
IMG OID637901158 
ProductHK97 family phage portal protein 
Protein accessionYP_506866 
Protein GI88658099 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCT TTAAAAAAAA GTCAATACAA GACAATTCTT ACACATTCTC AGTTCCCATA 
CAACTATTTA CAGAGGCTGT ATGGAAAAAC AGAAGCTATG CAAATTTCGC AGAAAATGGC
TACATAAAAA ACGTAATTGC TTTTAGATCT ATTCACATGA TTGCATCAGC TGCAGCATCT
GTTTCTTTAT TACTAAATAA GACTATCAAA AACAACACAT TCCAAATAAA AAACCATCCT
TTATTAAAAT TAATATCTAA ACCAAACAAC ACCACCTCAA AATCAGAATT CATCGAAGGA
ATTCTTACTT ATAAACTTAT TAGCGGTAAT GCTTATATTT TGACAATAGA AAATCATGAT
ATGATTCCCA AAGAATTACA TCTTTTGCGG CCAGATAGAA TTGAGATTAC TCCAGGAAAA
GATAATAGGC CATATTCATA TCGCTATTCT ATAAATAATT ATCACTATGA CTATAAGATT
AATAAATCAA CAAATTATTC ACAAATCCTA CACATAAAAA ACTTTCATCC ACTTAATGAT
TATTATGGAT TATCCCCTAT AGAAGCAGCT TCATACAGTA TAGATCAACA TAACCAGGCT
GGATCTTGGA ACCAAGCAAT GCTACAAAAT GGAGCTAGAC CAAGCGGGGC ATTAATTGTA
AACGCAAAAA GTAACAACAA TGGTAATTTA ACACAAGAAC AATACACTCG TTTAAAATCA
CAAGTAGATG AATTCTATTC TGGTCCAAGA AATGCTGGAA GACCAATATT ACTTGAAGGG
GGATTAGATT GGAAAGAAAT GAGTTTATCC CCTAAAGATA TGGATTTTAT CGAATCAAAG
CATAGTTCAG CACGTGACAT TGCATTAGCA TTTGGTGTAC CACCCCAATT GCTTGGTATT
CCTGGAGATA ATACCTATAA TAATCTTATC GAAGCAAGGC TTTCACTATG GGAACAAACA
ATATTACCTC ACTTAGACAA TATTATTTCA CATTTTAACA ATTGGTTAAT ACCCAGGTTC
GGAAGCAATA TGTTTTTATC ATATGATAAA GATTCCATCT CTGTATTAAC AGAAAAAAGA
AAACAGCTCT GGCAATACGT AGAAAATGCA ACTTTCATGA CTATCAATGA AAAAAGAGCA
GCTTTTGGGT TACCACCAAT AGAGAATGGA AATACTCTAT AA
 
Protein sequence
MNFFKKKSIQ DNSYTFSVPI QLFTEAVWKN RSYANFAENG YIKNVIAFRS IHMIASAAAS 
VSLLLNKTIK NNTFQIKNHP LLKLISKPNN TTSKSEFIEG ILTYKLISGN AYILTIENHD
MIPKELHLLR PDRIEITPGK DNRPYSYRYS INNYHYDYKI NKSTNYSQIL HIKNFHPLND
YYGLSPIEAA SYSIDQHNQA GSWNQAMLQN GARPSGALIV NAKSNNNGNL TQEQYTRLKS
QVDEFYSGPR NAGRPILLEG GLDWKEMSLS PKDMDFIESK HSSARDIALA FGVPPQLLGI
PGDNTYNNLI EARLSLWEQT ILPHLDNIIS HFNNWLIPRF GSNMFLSYDK DSISVLTEKR
KQLWQYVENA TFMTINEKRA AFGLPPIENG NTL