Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0892 |
Symbol | |
ID | 3927376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 917311 |
End bp | 918612 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637902009 |
Product | CBS domain-containing protein |
Protein accession | YP_507687 |
Protein GI | 88658382 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4536] Putative Mg2+ and Co2+ transporter CorB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0338077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTAT TAGTAACTTC AGTATTTAGT ATACTTATAC TACTAATTTT ATCTGCATTT TTTTCTGCTG CAGAAACAAG TATAACTTCA ATTAGTAGTT CACTTATCCA TAAATTAATG CTACAGGGTA ATAAAAGAGC CCAAATCATT AACACACTCA GTCAAAAGAA AAAGCTTGTT ATAAATACTG TATTAATAGG CAATACTATT ATAAACATCA CTGCTTCTTC TATTGCAACA GCAATTTCTA TTGAAATTTT AGGACCACAG GGGATATTAT TTTCAACCGT CATTATGACA TTGTTTATAC TGATATTTTC TGAAGCATTA CCAAAAAGCT ATGCAATACT CAATCCAGAA AAAATTGCTT TAATGATATC GTGTCCCTTA TCATGTTGCG TACTCATTTT ATCCCCCATA ACACTATCAA TACAATATAT GATAGATTGT ATTTTAAAAA TTCTAGACAT GCATAAAGAT AAAGAAATCA TTTCAGCAGC AGAAGCTATG AGAAATTTAA TCTCACTACA TGATAGTAAA GGAACCATGC TAAAACAAGA TTTAGACATG TTGAGTAGCA TATTAGATTT AGCAGAAACA GAAATTTCAC AAGTAATGAC CCACAGGAAA AACATATTAG CTTTTAATAT AGATACAAAT ATAAATGATC TAATAAAAAA AATATTAGCA AGCTCTCATA GTAGAATACC ATTATGGAAA AATCAAGAAG ATCAAATCGT AGGAGTAGTA CATGTTAAAG ATGTAATAAC GCTAATACGG GAAAAAGGCA AAAATATTAC TCAAGAAGAT CTTCATAAAG TAATGACAAA ACCATGGTTT GTTCCAGATA CAACTCTGTT GAGTGTTCAG CTTCATAACT TCTTAAAAAA TAGAAGACAT CTTGCTTTAG TAATAGATGA GTACGGAGCA TTACAAGGAA TAGTAACATT AGAAGATGTT ATTGAAGAAA TAGTTGGAGA TATTACAGAT GAACATGATA TTACAACAGA AGCTCCAATA AAACAAATTT GCGAAAATAT ATATCATATT AATGGTTCTA CTTCCATTAG AGATATTAAT AGGCAGTTAC GTTGGAACTT ACCTGATGAA GAAGCATCAA CATTGGCAGG AGCAATTGTG TATGAAGTTG AACGTATACC TGAAGAAGGC GAGGAATTTT TACTATACGG ATTGTCATTT AAAATATTAA AGAAAAGTGG TCATACCATT TCTAGTATTC AAGTTGATAC ATCTCCAAAA GATACATCTA CTATAAAACA CAAGACAGAA CAACAAACTT AA
|
Protein sequence | MGLLVTSVFS ILILLILSAF FSAAETSITS ISSSLIHKLM LQGNKRAQII NTLSQKKKLV INTVLIGNTI INITASSIAT AISIEILGPQ GILFSTVIMT LFILIFSEAL PKSYAILNPE KIALMISCPL SCCVLILSPI TLSIQYMIDC ILKILDMHKD KEIISAAEAM RNLISLHDSK GTMLKQDLDM LSSILDLAET EISQVMTHRK NILAFNIDTN INDLIKKILA SSHSRIPLWK NQEDQIVGVV HVKDVITLIR EKGKNITQED LHKVMTKPWF VPDTTLLSVQ LHNFLKNRRH LALVIDEYGA LQGIVTLEDV IEEIVGDITD EHDITTEAPI KQICENIYHI NGSTSIRDIN RQLRWNLPDE EASTLAGAIV YEVERIPEEG EEFLLYGLSF KILKKSGHTI SSIQVDTSPK DTSTIKHKTE QQT
|
| |