Gene ECH_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0299 
Symbol 
ID3927821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp284058 
End bp286202 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content31% 
IMG OID637901423 
Productputative nitrogen regulation protein NtrY 
Protein accessionYP_507120 
Protein GI88658105 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.333561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCT GGTTTATAAA GGTATACAGT AATATAAGTA GATTTCAGTT TTTTAAAATC 
GTTGCTTCGT TCTTAGCAAT ATCTGTTTTC TTGTTTTTGA TTATAACATA CTGTGTTGTT
TTTCATCATG ATGATCCCCT TGGGCCTGAT CATAGTAGGG CTATAAAGTT AGTATTCTTT
GATTTGATAT TATTTCTACT GTTGATTGCT AGTGTATCTC ATAAGTTGAT TAATATGTGG
ATACGAAGAA AAAGGGGACA TTTAGGTTCT CATTTGCAAA CAAAGATTAT TCTAATGTTT
TCAGTAGTAG CTGTTATTCC AACAATTGTG ATTTCAGGTT TTTCTACTTT GTTTTTTAAT
TATAGTATTC AAGCTTGGTT TAATAAAAGG GTTGAAGCTG TAATGCATGA GTCTATCCAA
GTGGCAGAGG CTTATTTAAG GGAGCATAAG AGAAATATTC GTTCTGATAT TTTAGCTATT
TCACATTATA TATCTGATCA CAAGATAATA TTAAATTATG ATACAGCAGC ATTGCAAAAT
GTTGTGCAAT CTAGGGCAGA TTTATTAGGA TTGGCTGAAA TAATAATTTT TGAACCAAGT
AGGATAGTAG CAAGTAATAA GTTTAGTTTT GTATTAAATT TTGATATGGT CCCATGGAAT
GACTTAGATA GAGTGGATAA TGATAGGATT GTTATTATAA TAAAAGAAAA TATGATACGT
GCTTTTCTTT TACTTGATAG ATTCTCATCT ACTTATATCA TGGTAAGTCG TATTATAGAT
AGAAAAGTTA TTTCTCATTT ATCAGCAACG AAAGGAGCAG TTAAATCATA TAGAGATCTG
CAATCACAGA TATCATCACT ACAAATACAG TTTTCTATGA TATTTATTTT AATATCGTTA
TTGTTATTAT TGACTGCTAT TTGGTACGGA ATTAATTTTT CTGGTGACAT AGTTCGTCCA
CTGTTGGATT TATTCTACGC TACACGTAAA GTTCAAAAGG GTGATTTATC TTTCAAGATA
GAAGAAGGAC GAGTTGGAGA AGAAATGTCT ACGCTTGCAC GTGCATTCAA TCAAATGACG
TCACAGCTTA GTAGCCAACG TTCACAATTG ATAAAACTTT ATCAAGATAT GAATGAACGA
AGAGAATTTA TAGAAGCTGT TTTGTCAGGT GTTTCTTCTG GAATAATAGC AGTAAACTGT
GCTGGTATAA TTACTTTAAT GAATGATAAA GCAAAGGAAT TATTAGCTCC TAATGATATC
CTACATGCAG AACTTGATGA TGTATTTCCT GAAATATCTG AATTGATTAA AAGCTCTAAA
ATGCATGATG AAATTACTGT TTTGAGAAAT AGAAAGTCTT TCACTTTATC TGTGAGAATA
AAATTGCTTG GTAGTGTTAA TAAAGGATAT ATTATTACAT TTGATGATAT TTCAAGTTTA
GTTGATGCAC AGAGATCAGC TGCATGGTCT GATGTTGCAA GAAGAATAGC TCATGAAATA
AAAAATCCTA TTACTCCAAT ATATCTTGCT GCTGAAAGAT TAAGTAGCAA GTATCAGCAT
GAAATTATTT CTGATAAGGA AAGTTTTGGA AGATATATTG ATACAATAAT TCGCCATGTG
ACGAGTATTA GGTGTATTGT TGATGAGTTT GCAAAATTTG CGAAAATGTC GGATCCTGTG
TTAATTAAAC ATGATATTTG TAGTGTAGTA AAAGAGTTGG CGTTTTCCGG GCAGTTTAGT
CGTGGTGTTG TGCATTATGA GTTAGATATA CCAAGTTATC CAATTTTTGT TGTATTAGAT
AAAAATCAGA TAAATCATGT ATTTATTAAC TTGTTCAAGA ATGCATGTGA ATCAATTGAT
ATGAAGTCTA ATAGTATACA GGGACTGATA AAAATTAGTG TAGTTGATGA TAGAGATTGT
GTTAATGTGC AAATTCAGGA TAATGGTGTA GGATTTCCAG AAGAACTTAT GGAAAAGCTT
ACAGAACCTT ATGTTACAAC TCGTGTCCAA GGAACAGGAT TAGGTTTATC CATAGTAAAA
AAGATACTAG ATGAACATAA TGCAACAATA AGTTTCTATA ATATAGAAGA TGGTGGTCTG
GTTAAGTTGA CATTTGTAAA ATGTCAGGAT TGTGAAAATA TTTAA
 
Protein sequence
MKGWFIKVYS NISRFQFFKI VASFLAISVF LFLIITYCVV FHHDDPLGPD HSRAIKLVFF 
DLILFLLLIA SVSHKLINMW IRRKRGHLGS HLQTKIILMF SVVAVIPTIV ISGFSTLFFN
YSIQAWFNKR VEAVMHESIQ VAEAYLREHK RNIRSDILAI SHYISDHKII LNYDTAALQN
VVQSRADLLG LAEIIIFEPS RIVASNKFSF VLNFDMVPWN DLDRVDNDRI VIIIKENMIR
AFLLLDRFSS TYIMVSRIID RKVISHLSAT KGAVKSYRDL QSQISSLQIQ FSMIFILISL
LLLLTAIWYG INFSGDIVRP LLDLFYATRK VQKGDLSFKI EEGRVGEEMS TLARAFNQMT
SQLSSQRSQL IKLYQDMNER REFIEAVLSG VSSGIIAVNC AGIITLMNDK AKELLAPNDI
LHAELDDVFP EISELIKSSK MHDEITVLRN RKSFTLSVRI KLLGSVNKGY IITFDDISSL
VDAQRSAAWS DVARRIAHEI KNPITPIYLA AERLSSKYQH EIISDKESFG RYIDTIIRHV
TSIRCIVDEF AKFAKMSDPV LIKHDICSVV KELAFSGQFS RGVVHYELDI PSYPIFVVLD
KNQINHVFIN LFKNACESID MKSNSIQGLI KISVVDDRDC VNVQIQDNGV GFPEELMEKL
TEPYVTTRVQ GTGLGLSIVK KILDEHNATI SFYNIEDGGL VKLTFVKCQD CENI