Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0523 |
Symbol | |
ID | 3927260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 522829 |
End bp | 524298 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637901646 |
Product | hypothetical protein |
Protein accession | YP_507338 |
Protein GI | 88658285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0189949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATC TTTTTGTGAT TTCTGCTTTT ACTTCTCTGT TAATGATGTC TTCCTATAAT GCTTTTTCTG ATGAAATATT AGATGGTATT TTTGGAAGTA ATAATAAATT TATAAATAAT ACTAAGAATT CATTTTCAGG AATTAATAAT AAGGCGTTGG TAAAAGGTGG ACGTATTAAA TTTGCTGGTG ATATGATATC TTATACTTGG TATTCAAGTG ATGATACTAG GAATTCGAAT AAATTTAGTA GGGTTAGTAA GCGTTTTGAT ATAGATACGG GAGGGAATAT TAATAATGTT GGTGCTAAGC ATGATGGTAT GTTTAGTATT GAAATTGATT CAAATCCGGA TAAACATGGA ATTGTATATG GAGCATATTC TCAAATAAAT ATCCCACATG TCGCAGGAAA AAGTTTTGGT AATAATGCTG CATTTAATAG AGGATCAAAA ATATTTGCTA AAACTCCTTA TGGTAACTTT TCAGTTGGAT ATCAGGAAGG TGTAGAGTCC ATGATGAAAT TAAATGCTTT TAGTATAGTT GCTGGTGATG ATTCTAATAT ATGGACAAAG CATTTAAGGA ATATTCTGCA TGAAAAAAAA GATGGTCAAA GTGGTTATTC AGTATATTAT TTCAATTTTA ACTCTGGGTT ATATAGTGAA AGTTTATTTC GTAATAGTGA CAATATTGTT TTTGATGACA TAGACTATTA TTTAGGAACT GGTATTATTT CTAGGAGTTT TATTAATAAT TTGCCATTTA GGTTATCTTA TCAATCACAG AATTTTATGG GATTAAGGTT TGGAGTAAGT TACTCTCCTT TTGGGTATGA TCAGAGATTG TTTGAATTAC AAAAAGATCG AAACAGTGAT ACTTTAATTT TGGTTGGGCC AAGGTATAGA CATATTGTTA GTGGAGGTAT TTCTTATACT TATAATATTA AAAATTTAAA ATTTAGTGCT TCCGTGATAG GTGAATATGG TGATGAAGAG CATGATTATA AAGCGCACTA TAATAGATAT TATAGGCATA ATACATTAAA GGCAGTATCC ATTGGTTGGA ATGTTGGTTA CGATAAAATA GAATTAGCAG GGTCTTATGG AAAATTGAAT AGTGCTGGAA TTCCTTATGA TAAGTGTGTT ATACATGGAG TTCCTTATGA GTATGTATAT AGGAGTGTTA TTCACTGGCT GTATTTGAAA GACATGGATT ACTATTGGGA TATAGGTATT GCTTATAAGT ATGCACCTTT AAGTCTAAGT GTTATTTACT TTATGAGTAA TAGGGTTGGT AATGAATTAA GTGATGTAAA TGTAGGCATT GAATATGATA TTTTAAAATA TAGCGGTTTT AAGAGTAGTT TGTTTGCTAA TTACAATTAT TATACTTTTC GCCAATTTAG TGACACTTAT AGGATTCATG TGAATGGTAA GGGCAGTATA TTGTTAGTGG GTGCAAAGTT AAGTTTTTAA
|
Protein sequence | MKNLFVISAF TSLLMMSSYN AFSDEILDGI FGSNNKFINN TKNSFSGINN KALVKGGRIK FAGDMISYTW YSSDDTRNSN KFSRVSKRFD IDTGGNINNV GAKHDGMFSI EIDSNPDKHG IVYGAYSQIN IPHVAGKSFG NNAAFNRGSK IFAKTPYGNF SVGYQEGVES MMKLNAFSIV AGDDSNIWTK HLRNILHEKK DGQSGYSVYY FNFNSGLYSE SLFRNSDNIV FDDIDYYLGT GIISRSFINN LPFRLSYQSQ NFMGLRFGVS YSPFGYDQRL FELQKDRNSD TLILVGPRYR HIVSGGISYT YNIKNLKFSA SVIGEYGDEE HDYKAHYNRY YRHNTLKAVS IGWNVGYDKI ELAGSYGKLN SAGIPYDKCV IHGVPYEYVY RSVIHWLYLK DMDYYWDIGI AYKYAPLSLS VIYFMSNRVG NELSDVNVGI EYDILKYSGF KSSLFANYNY YTFRQFSDTY RIHVNGKGSI LLVGAKLSF
|
| |