Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2561 |
Symbol | |
ID | 8808345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2699407 |
End bp | 2701362 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF699 ATPase putative |
Protein accession | YP_003461786 |
Protein GI | 289209720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00750004 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGACAGG CAGAGGACAG ACCCACCCTT GGCTGGCGGC GGCTTCTGGC CCACCCGATG TCGCAGGCGG CGCGCGAGCG TGCCGCCACC TGGCTCGGGC GGATGCCCAG CGGGAAAAGC TCGCTCTGGG TCGGCCGCGA GGCACCAGCG GACGTGGCGC GGGTAAACCC CGCGCAGGCT GGTCACTGGC TGGGACGCGA GGCCGACGCG ATCGTGTTCG AGGCCTCCTG GCCCCTCCCG GCAGACGCCC TCGCCATCGC CGGGGGACTG CTGCGCGGGG GCGGCGTGCT CCTGCTGCTG GTCGATCCAC CTGGCGACGG GGCGTTTGGC CGGCGTTGGG CGAACGCCCT GCGGCACGCG CCCGTTCACT GGGTGAATGA CGATGTCGAG CAATGGCCAA TCCCCGACGC GGGCGTACAG CCTCCCTGGG CCTGGACGGA AGATCAGCGG CGCGGGCTCG TGACCCTGGA TGAACTTGGC CCCGGGGAAT GTGGCGCACT GGTTGCTGAT CGTGGACGTG GCAAGTCGAC CTTGCTGGGG GAGTGGATCG CCTGCCAGCG CAGCGCCGGC ACACCGGTCG TCGTCACGGC ACCCGGCGCC GGCGCCGTTC GGGCGCTCTT CCGTCAGCTG GAACGCCACC CGGGGCCGTC GGTACCTTTC TGCTCCCCCG ACCAGATCGA AGCCCTCGCC ACCGCACCCG ACACCCTGGT GATCGACGAG GCGGCGGCGC TGCCGGTCGA CCGCCTGGTG CGGCTGGCGG GACGTGCCCG GCGACTGGTG CTGTCGACGA CTACCGGCGG CTTCGAGGGC AGCGGACAGG GCTTTCGCCT GCGCGCGCTT CCCGCCCTGC AGCGCATGGG CTTCCGAATT CGGCAGATTG GGCTTGAGCA GCCCGTGCGC TGGGAGAGTG GCGACCCGCT GGAGGCCTGG CTGGATGACC TGTTCCTGAT GCGGGCCAGG AGTCGCGCAC CCGCCCGTCA GGCCGTGATA TTTCACTGGC TGTGTGGCAA GGGCCTCGTC TGTGACACGC GTCGGCTCGA GGACGTGGCC GGCCTGCTCG CCGACGCCCA CTATCGCACA CGCCCGTCGG ATCTGGCCCG CTGGCTGGAC GATCCTGACG CGTACCTGAT GCTGCTCCTC GGTGCGCAGG ACCGGGCGCT GTTCGGGGTT GCCCTGGTCC AGAAAGAGAG TGGCCTGGCG CCCGAACTCG CGGAAGCGGT CTGGGCCGGG GAGCGGCGCC CCCAGGGACA CTTCCTGCCC TGCACACTGG CTGCTCGCGG GGAGTTCGCT CTGGCCACCC GGGCCTGGTG GCGAATCCAA AGGCTCGCCG TACACCCCGC CTGGCAAGGG AGAGGGCTTG GGGGCCGCCT GCTGCGCGCA GTCGAGGAAT GCGCGGCACA CCACGACATC GCACTGCTTG GCACGAGCTT CGGACTGCAA CCGGCACTGG TACGGTTCTG GGGACAGGCA GGCTGGCAGC CGGTGCGGGT CGGGGAACGA CCAGATCCCG CCAGCGGCGA GGTGTCGATC GTGCTGACCC GGGCGTTGAC ACAAGACACC GCTGCCCTCG TCGATGCGGC CGCAGCGTCT TTCGCGCGCG ACTGGCGCGA GGGCGGCGCC GCGCTACTGC GCGATCAGAC GGGCCGGCGA CGCCGGGTCC TTGAAGCCAT CCAGCCGTCC GCCGTCACGG CACCCGACCG GGATCACGCC CGGGACATCG AAGAGATTCG AGCCTTTGCA CGGCGGCAAC GACCGCTCCA ATGGATACGG GCCGCCCTGC GACGCTCCCT TGTTGCACAC CCGCCGCAAG GACCGGAAGG AAGGCTGCTG CAATCCGCCA TCGAGTCCCT GGATGACGAG GCGCTGGCCC GTGAACTGGG GTTTTCCGGA CGGCGGGAGG CCGTTCGACG CCTGCGCCAG GCCGCGCAGA CCTGGCTGGC GCACCCCGAG GCGTGA
|
Protein sequence | MRQAEDRPTL GWRRLLAHPM SQAARERAAT WLGRMPSGKS SLWVGREAPA DVARVNPAQA GHWLGREADA IVFEASWPLP ADALAIAGGL LRGGGVLLLL VDPPGDGAFG RRWANALRHA PVHWVNDDVE QWPIPDAGVQ PPWAWTEDQR RGLVTLDELG PGECGALVAD RGRGKSTLLG EWIACQRSAG TPVVVTAPGA GAVRALFRQL ERHPGPSVPF CSPDQIEALA TAPDTLVIDE AAALPVDRLV RLAGRARRLV LSTTTGGFEG SGQGFRLRAL PALQRMGFRI RQIGLEQPVR WESGDPLEAW LDDLFLMRAR SRAPARQAVI FHWLCGKGLV CDTRRLEDVA GLLADAHYRT RPSDLARWLD DPDAYLMLLL GAQDRALFGV ALVQKESGLA PELAEAVWAG ERRPQGHFLP CTLAARGEFA LATRAWWRIQ RLAVHPAWQG RGLGGRLLRA VEECAAHHDI ALLGTSFGLQ PALVRFWGQA GWQPVRVGER PDPASGEVSI VLTRALTQDT AALVDAAAAS FARDWREGGA ALLRDQTGRR RRVLEAIQPS AVTAPDRDHA RDIEEIRAFA RRQRPLQWIR AALRRSLVAH PPQGPEGRLL QSAIESLDDE ALARELGFSG RREAVRRLRQ AAQTWLAHPE A
|
| |