Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1084 |
Symbol | |
ID | 8806844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1147988 |
End bp | 1149964 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Aminopeptidase N-like protein |
Protein accession | YP_003460332 |
Protein GI | 289208266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGG CATGGTGGAT CTCGATTGGC CTTCTGGCGT CGCTGGGCTT CGTGGTCGCT TCGGCGGCCG GTAACGACTA CGCCGAACTC GATTCCGACT TCCCGATCGG CCTGAACGTG CACCTGCACC CGGGCGAATC GAGCTTCGAC GGGCGTCTGG TGCTGGACCT GGAGCTCCTG GACCGGAGCC CGCCGAGCCG ATTGCGCCTT GATCCCGCCA TGGGGATCGA CGCCGTAACG CTGGATGGCG ATTCCATCGA ATACGAGTTG ACCGAGTCGG GGTGGCTGGA GCTGGGCGAT GCCCTGACGA AGCGGGGTAC CGGCCTCCTG GTCGTCGAGT ATGGCGCGCG GTTGTCGCAG CCTGACCAGG CGCGGCCGGG ATCGGGATTC CTGGGCGAGG AGGCCGGGTT CCTGCCGGCT GGAGCCGGCT GGTATCCCGC GCTGGAGGAC GAGCCCGTTC CCATGCGGAT TCAACTGACG GTGGACGGTG GCCAGAAGGC CGTGGTCTCC GGCTCGCTGA TCGATGACAC GCGCGTGGGC GAGCGCTATC AAGCCCTTTA CGAACACCCC GCTGCGCGCC ATTTGGCGCT CGCCTCCGGC ATTTGGGAGT TGGGCGTGAT CGAGGGCGAT GGGTGGCAGG TGCGTACCCT GTTCCCGCAG TCGCTGCACG AGGCCTTTGG CGAGGCCTAC CGCGAGAAGA CCGCCGAGTA CATCGAGCGC TTCGAATCCG AGATCGGGCC GTATCCGTTC AAGAGCCTGA CCGTTGCGGC CTCGCCGCAA CCGGTCGGAC TGGCGTTCTC CGGCTTTACC CTGCTGGGCG AACAGGTGAT CCCGCTCCCG TTCATCCCGC ATACCTCGCT GGGGCACGAG GTCCTGCACA TGTGGTGGGG TACGGGGGTC TACGTCGACC GTTCGGGCGG GAACTGGTCG GAAGGCCTGA CCACCTACAT GGCGGATTAC CAGTTCAAGC GGGAGGCAGG CGAGGGCCGC GAGGAGCGGG GGCAGTGGCT GCGCGACTAT GCGGCGCTCC CGGCGGACGA AGATCGCGCA CATGGCGAGT ACCAGGGCGG CAACCAGGGC GCGCTGCGCA TCGCCGGCTA TCACCGTGGT GCGATGATCT GGCGGATGCT GGAAGACTGG GTCGGGCGTG ACGCGTTTCT GGAGGGTGCT CGCACGCTCT ACAGCGACTG GAAAGGGCGC GAGGCAGACT GGAATGCGCT GATCGCGGCC TTCGATTCCG CTACCGACGA GGATCTGGAG CCGTTCTTCC GCCAGTGGAT CGAGCGGCGC GGCGCGCCGG GGCTCGAGCT GGCCGATCTC GCCGTCGACC AGGACGAGAA CGGGTGGACG CTGTCGGGCA CGTTGCGTCA GTCCGGCGAT GACAAACCCT GGGATCTGCG GGTACCGCTT GTGGTCGAGA GCGAACAGGG GTCGGAGACG TTCTGGCTGG CACTGGCCGA GAGCGAAAAG TCGTTCGTGG TGGAGCTGGA TGCCGAGCCG TTGCGCGTCG CCGCGGACCC GGACTGGCGG CTGTTTCGCC ATCTGGCCGA AGACGAATCC CCGGCGATCC TGCGCCGCGC CGCACTGGAC CCGGACACGC AGGTGATCGC GCTGGGTTCG GCGCGGCCGG TCGAGGTCGC GCCCTGGCTG GGGCGCGCAG CCGAACAGAG CCAGAACGTG GAAGGGACCC GCATCGTACT GGGCGAACCC AAATCGGTGC GCGACTGGCT CCGCGCCCGC GACTGGCCGG AATCGCCGGC GGCGATCGAC AGCGATCTGC CCGAGTCCCC GGACGCCGCC GCCTGGGCCG TGCCCGGCGA GGATGTGGTC GTGATTGCGG GTCCCGATGC GCAGCAACGC CGCGCACTGG CGGGCGAGCT GCGCCACCGG GCGCACTACA GCTATGTCCT GCTGGAAGGG GACCAGCGCA GCACCGGGCA CTGGGCCACG CCGGGGATCT GGAAGGAACT GCAATGA
|
Protein sequence | MRAAWWISIG LLASLGFVVA SAAGNDYAEL DSDFPIGLNV HLHPGESSFD GRLVLDLELL DRSPPSRLRL DPAMGIDAVT LDGDSIEYEL TESGWLELGD ALTKRGTGLL VVEYGARLSQ PDQARPGSGF LGEEAGFLPA GAGWYPALED EPVPMRIQLT VDGGQKAVVS GSLIDDTRVG ERYQALYEHP AARHLALASG IWELGVIEGD GWQVRTLFPQ SLHEAFGEAY REKTAEYIER FESEIGPYPF KSLTVAASPQ PVGLAFSGFT LLGEQVIPLP FIPHTSLGHE VLHMWWGTGV YVDRSGGNWS EGLTTYMADY QFKREAGEGR EERGQWLRDY AALPADEDRA HGEYQGGNQG ALRIAGYHRG AMIWRMLEDW VGRDAFLEGA RTLYSDWKGR EADWNALIAA FDSATDEDLE PFFRQWIERR GAPGLELADL AVDQDENGWT LSGTLRQSGD DKPWDLRVPL VVESEQGSET FWLALAESEK SFVVELDAEP LRVAADPDWR LFRHLAEDES PAILRRAALD PDTQVIALGS ARPVEVAPWL GRAAEQSQNV EGTRIVLGEP KSVRDWLRAR DWPESPAAID SDLPESPDAA AWAVPGEDVV VIAGPDAQQR RALAGELRHR AHYSYVLLEG DQRSTGHWAT PGIWKELQ
|
| |