Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1894 |
Symbol | |
ID | 8807667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2010332 |
End bp | 2011597 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003461121 |
Protein GI | 289209055 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACCC GGCTGCACGA ACGCTTCGCG AACGATCCGG CCGCGCGCAT TGCGGAGGAT GCGCTGCGCG CCTGCGTGCA CTGCGGCTTC TGCAACGCGA CCTGCCCGAC CTACCAGGAG CTGGGCGACG AGGTCGACGG CCCGCGCGGG CGCATCTACC AGATCAAGGA AATCCTCGAA TCCGGCGAGG CCAACCCCAC CGCGCGCACC CACCTGGACC GCTGCCTGAC CTGCCTCAAC TGCATGACCA CCTGCCCCTC GGGCGTGGAC TACAACCACC TGGTCGCCTA TGGCCGGGAG GTGATCGAGC AGGACCTCGA TCGCAACTGG CGCGAGCGGA CCCTGCGCGG CCTGCTGGCC CGCACCCTGC CCGCGCGCTG GCCGTTCCGC CTGGCCCTGG GGCTGGGGCG CATGGTGCGC CCCGTCCTGC CCGCGACCCT GCGCCGCCGG GTACCCGCGC GGCAGGAAGC CCACGAAGCG ACACAACCGA TCCCTTCGCC AATCCTGGAA GGCTTTCCCC ACGGGCGTGT CCTGCTGCTA AGCGGTTGCG TCCATGATGC AATCGACGCG CGCACCAACG CCGCACTGAA GAACCTCCTG GGGCGCCTCG GCGTCGAGGT CGTGGAAGCC GGCGAGGCTC GGTGCTGCGG GGCCGTCGAG CACCACCTGG CTTTCGAGGA ACGCGCCCGG CAACGGGTGC GCGCCAACCT CGACGCCTGG CAACCCGCGC TCGACCAGGG CGTGGACGCG GTGGTCAGCA CCGCCAGCGG CTGCGGCGTG ATGCTGCGCG ACTACGGCCA CCTGCTGGCT GGCGATTCGG ACTACGCGGA ACGCGCACGC GGCGTCTCCG AGCGCGTACG CGACCCGGTG GAATTGTTCA GCCCGGACGC TATCCAGAAG CTGGGCATTC GCGCCCCGCA GGGCCACGAC CCGATCGCCT GGCACGCCCC CTGCACCCTG CAGCATGGCC AGCGCCTGGC CGGGCGCGTA GAACCGCTTC TGCAGGCGGC AGGCTTCGAA CTGGCACCCA CAGCCGAGCC TCACCTATGC TGCGGCTCGG CAGGGACCTA CTCGATCACC CAGCCAGCGA TGGCCGCCCG TCTGCGTACC CGCAAGCTGG ACAATCTGGA GGCCGCCGCA CCCGCGCGTA TCATAACGGC CAACGTCGGT TGCCAGACGC ATCTCCAGTC CGGGACCGAG ACGCCGGTCG AGCACTGGCT GACGCTGCTC GAGCGCCATT GTCCGGCAAC TCCCGCCGGT GCGTAG
|
Protein sequence | METRLHERFA NDPAARIAED ALRACVHCGF CNATCPTYQE LGDEVDGPRG RIYQIKEILE SGEANPTART HLDRCLTCLN CMTTCPSGVD YNHLVAYGRE VIEQDLDRNW RERTLRGLLA RTLPARWPFR LALGLGRMVR PVLPATLRRR VPARQEAHEA TQPIPSPILE GFPHGRVLLL SGCVHDAIDA RTNAALKNLL GRLGVEVVEA GEARCCGAVE HHLAFEERAR QRVRANLDAW QPALDQGVDA VVSTASGCGV MLRDYGHLLA GDSDYAERAR GVSERVRDPV ELFSPDAIQK LGIRAPQGHD PIAWHAPCTL QHGQRLAGRV EPLLQAAGFE LAPTAEPHLC CGSAGTYSIT QPAMAARLRT RKLDNLEAAA PARIITANVG CQTHLQSGTE TPVEHWLTLL ERHCPATPAG A
|
| |