Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1320 |
Symbol | |
ID | 8807086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1406998 |
End bp | 1408011 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003460564 |
Protein GI | 289208498 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCC GACAACATTT TCGCCTGGGA TTGTTCATCG TCGGTGGCCT GGTGGCGCTG GTGACCGCCC TGTTCATCAT GACCGCGGGC AATATCTTTC GCGCCTCGAT CCCGATCGAG ACCTACATCG ATTCATCCGT GCAGGGGTTG GAGATCGGAG CGCCGGTCAA GTTTCGAGGT GTGACCATTG GCGAGATCAC GAACCTGGGT TTTACCTCGG TGGAATACCA GCAGGACGTG GACCCTCGTG AGCGCAAACG CTATGTGATG GTCGAGGCCC GGCTCTGGCC GGACCGCTTC GCGGCGAGCG CCCGCGAGCA GGATTTCGAG GCGGATGTCC TGAAGAACCT GGTGGATGCG GGCCTGCGAG TGCGCATTGC CGCACAGGGC ATTACCGGTA TGAACTATCT GGAAGCGGAC TTCTCCGATC CGGACGAACA CCCGCCGCTG GAGCACGACT GGGAACCGCG GTCGATCTAC ATCCCGTCGT CCCCGAGTAT CGCCGTGCAA TTCATGGAGT ACGCCGAGAA TCTGCTTAAG CGGATCGACG GGCTGGACAT CGAGGGTGTG ATCGAGAATC TGAATTCTTT GCTGGTGACC GTCGACGATA CCGTGTCCAG TCTCGATACC GGAGGACTGA ACCAGCGAGC GGATGATCTG ATCTCGGAAC TCGAGCAGAC CATGCGCACG GCCGATCGGG TGATGCAATC GGTCGAGGCC CTGGTCGAGC ATCCGGACAC GCAGGCCCTT CCGAGCGAAA CCCGCAAAGC GATCCAGGAG CTGCGTCGAA CGGCCGAGGC AGCCGATGTC GCGGGCCTGG TCGACCGCAT GGACTCGATG GTTGAACGAC TCGATCGGGG GATGGATGTC AGCGAGTCGA AACTGCTGGA AACCCTGGAC GAGCTGCAGG GCGCGATCTC GGGCCTGCGC AGCCTGTCGG ATGATGTACG CCGCAACCCC GGTGGTGCGC TGTTCGGTGC CCCGCCACCG CGCAGCCGCA TGGACGAGGA GTAA
|
Protein sequence | MSARQHFRLG LFIVGGLVAL VTALFIMTAG NIFRASIPIE TYIDSSVQGL EIGAPVKFRG VTIGEITNLG FTSVEYQQDV DPRERKRYVM VEARLWPDRF AASAREQDFE ADVLKNLVDA GLRVRIAAQG ITGMNYLEAD FSDPDEHPPL EHDWEPRSIY IPSSPSIAVQ FMEYAENLLK RIDGLDIEGV IENLNSLLVT VDDTVSSLDT GGLNQRADDL ISELEQTMRT ADRVMQSVEA LVEHPDTQAL PSETRKAIQE LRRTAEAADV AGLVDRMDSM VERLDRGMDV SESKLLETLD ELQGAISGLR SLSDDVRRNP GGALFGAPPP RSRMDEE
|
| |