Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0031 |
Symbol | tlyC |
ID | 3927904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 27362 |
End bp | 28249 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901156 |
Product | hemolysin |
Protein accession | YP_506864 |
Protein GI | 88658322 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.551744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA AACTTTCTGA TAATAGTGAT AGTGAAGATA GGTTATCTTT AAAAGAGTCG TTAAGCGCTA AGTTAGTTTC ATTTATGATA GAGAGAATGC CTAAATTAAA AAGGGTTGTC GAAAACAATT TGATTGATGA TAACGAGTGT TTCACCAACT CCCGCATGTT TTATAATCTT AGAAAGTTCA ATGATTGCTT GGTGAGAGAT GTAATGATTC CTAGAACAGA AATTTATGCC ATAGACATTA CAGATGTACC TGATAGCAAA AACTTGATAG ATAAGGTAGT GAGCGGGCAG TACACCAGAA TTCCAGCATA TGAGAACAAT TTGGATAACA TAATAGGGTT TATTCACATT AAAGACATAA TTTCAAATTT TCATAATGAT TTTAATGTGA GAAATATAAT TCGTGAGGTT ATGTATATCC CTCCATCTAT GAAAGCAGTC AACTTATTTA TAAAAATGCA ATCTTCTCAC ATACATGTTG CTGTAGTAGT AGATGAGTAT GGTGGAACAG AAGGATTAGT GACTATGGCG GATCTTATTG AAGAAATAGT AGGTGATATA GACGATGAGC ACGATGTTCC CACTGTTCCA AGTATTGTTA ATATCTCTGA TAATAAAATT GAAGTAAATG CCAGGGTATT AGTTAAAAAC TTGGAAGAAA TTTTCAATAT TGATTTTAGA GATTGTAAAG AAGATGATTA TGTAACAATA GGTGGACTCA TTTTATCTAT GATCGGTAGA GTACCTATGA CAAATGAAGT TTTTAAACAT AAAAGTGGAG CTGTATTTTC CATTAAAGAG GCTGATGATA GGTGTATTTA TAAAATAGTT ATTGATTTAA GTGATGTAAA AAAAGTTGAC TTAAAAATCG ATTATTGA
|
Protein sequence | MSDKLSDNSD SEDRLSLKES LSAKLVSFMI ERMPKLKRVV ENNLIDDNEC FTNSRMFYNL RKFNDCLVRD VMIPRTEIYA IDITDVPDSK NLIDKVVSGQ YTRIPAYENN LDNIIGFIHI KDIISNFHND FNVRNIIREV MYIPPSMKAV NLFIKMQSSH IHVAVVVDEY GGTEGLVTMA DLIEEIVGDI DDEHDVPTVP SIVNISDNKI EVNARVLVKN LEEIFNIDFR DCKEDDYVTI GGLILSMIGR VPMTNEVFKH KSGAVFSIKE ADDRCIYKIV IDLSDVKKVD LKIDY
|
| |