Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2515 |
Symbol | |
ID | 8808299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2644814 |
End bp | 2646745 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | exosortase 1 system-associated amidotransferase 1 |
Protein accession | YP_003461741 |
Protein GI | 289209675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0685531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGGTA TCGCCGGTAT TTTCGATCTG AACGGGCGGC GCCCGATCGA TCGCGCCTGC CTGGAGGGCA TGAATGCGAC CCAGTTCCAC CGGGGCCCGG ACGAAGGTGG AATCCACCTG GAACCGGGGG TCGGGCTGGC GCACCGACGC CTGTCCATCA TCGACCTCAA GGGTGGCCAG CAGCCGCTGT TCAACGAGGA CGGCAGCGTC GTCGTCACCT ACAACGGCGA GATCTACAAC TTCCCGGAGC TGACTCACGA ACTCCAGCGG GCGGGCCATC GCTTTCGTAC CCACTGCGAT ACGGAAGTCA TCGTGCACGC CTGGGAGGAA TGGGGCGAGG ACTGCGTCCA GCGCTTTCGC GGGATGTTCG CCTTCGCCCT CTGGGATCGC CACACCCAGA CCCTGTTTCT GGCGCGGGAT CGCCTGGGCA TCAAGCCGCT TTACTACAGC GAACTCCCGG ATGGCCAACT GATCTTTGGC TCCGAGCTCA AGGTCCTGCT GGCCCATCCC GGGCAGACGC GCGAGCTCGA CCCCTGCGCG GTCGAGGAAT ACTTCGCCTA TGGCTATGTG CCCGAGCCGC GCAGCATTCT GTCCGGCGTC CACAAGCTGC CACCGGGGCA TACGCTGAGC GTACGGCGCA ACCAGCGCTC CGCCGCCAGT CCGCGCGCGT ACTGGGATGT CCCCTTTGCG ACAGCCGCGC CCATCGGCAT GTCGGAGGCG GCGGAGGAGT TGACCGAACG TCTGCGCGAG GCCGTGCGCA TCCGCATGGT GGCCGAGGTA CCGCTGGGGG CCTTCCTGTC GGGGGGCGTG GATTCCAGCG CCGTGGTGGC GATGATGGCC GGGCTCTCCT CCGAGCCCGT GCGCACCTGC TCCATCGGCT TTGCCGACCC CGCCTATGAC GAGTCACGCT ATGCCCGGGA GGTCGCCGAG CGCTACGCAA CGGACCATCA CAACCGCGAG GTCGACCCCG ATGACTTTGC CCTGCTGGAG CGTCTGGGCG ATCTCTACGA CGAGCCGTTT GCCGACAGCT CGGCCCTGCC GACCTACCGC GTCTGCGAGC AGGCCCGCCG ACAGGTCGTG GTCGCGCTGT CCGGCGACGG GGGCGACGAA ACCCTGGCGG GCTATCGCCG CTATCGGCAC CACCTCGCCG AGGAACGCCT GCGCAACCCG CTGCCCCTGG GCCTGCGGCG CGTGCTGTTC GGCAGCCTGG CCCAGGTCTA CCCCAAGGCG GACTGGGCCC CGCGCCCGCT GCGCGCTCGG GCGACCTTTC AGGCACTGGC ACGGGATGCG GTCGAGGGCT ACTTCCAGGG CGTGTCGGTG CTGCGCGACG AGCAGCGCAG CACGCTGTTC TCGTCCGCAT TCCGCCGCGA GCTGCAGGGC TACGGGGCGG TCGAGGTGCT GCGCGCCCAT GCCGGGCGGG CCCCGACCGA TGACCCGCTC TCGCTGGTGC AGTACCTGGA CCTCAAGACC TACCTCCCGG GCGACATCCT GACCAAGGTC GACCGCGCCA GCATGGCTCA CTCGCTGGAG GTCCGCGTGC CGCTGCTGGA TCACCCGCTG GTGGAGTGGG CCTCCACCCT TGCACCGGAA CTGAAGCTGC GCGATGGCGA AGGCAAGGCG GTTCTGAAAA AGGCGATGGA ACCCTACCTG CCCCACTCGA TTCTCTATCG GCGCAAGCGG GGCTTCGCCG TACCCCTGGC CAACTGGTTC CGGGGGCCGT TGCGCGAGGT GATGCACGAG CGGGTGCTGG GGGAGCAGAT GCGCGATTCC GGGCTGTTCG ACGAACGCAC CCTCAGGCGT CTGGTCGACG AGCACGGCCG GGGCGCGCGC GATCACAGCG CGGCGCTGTG GTCATTGATG ATGTTCCAGA CCTTTCTGGA CAAGGTGCAA CCGACCAGCC CGCGCAGCGG CTGTGTCCTG CCTGGAGCCT AG
|
Protein sequence | MCGIAGIFDL NGRRPIDRAC LEGMNATQFH RGPDEGGIHL EPGVGLAHRR LSIIDLKGGQ QPLFNEDGSV VVTYNGEIYN FPELTHELQR AGHRFRTHCD TEVIVHAWEE WGEDCVQRFR GMFAFALWDR HTQTLFLARD RLGIKPLYYS ELPDGQLIFG SELKVLLAHP GQTRELDPCA VEEYFAYGYV PEPRSILSGV HKLPPGHTLS VRRNQRSAAS PRAYWDVPFA TAAPIGMSEA AEELTERLRE AVRIRMVAEV PLGAFLSGGV DSSAVVAMMA GLSSEPVRTC SIGFADPAYD ESRYAREVAE RYATDHHNRE VDPDDFALLE RLGDLYDEPF ADSSALPTYR VCEQARRQVV VALSGDGGDE TLAGYRRYRH HLAEERLRNP LPLGLRRVLF GSLAQVYPKA DWAPRPLRAR ATFQALARDA VEGYFQGVSV LRDEQRSTLF SSAFRRELQG YGAVEVLRAH AGRAPTDDPL SLVQYLDLKT YLPGDILTKV DRASMAHSLE VRVPLLDHPL VEWASTLAPE LKLRDGEGKA VLKKAMEPYL PHSILYRRKR GFAVPLANWF RGPLREVMHE RVLGEQMRDS GLFDERTLRR LVDEHGRGAR DHSAALWSLM MFQTFLDKVQ PTSPRSGCVL PGA
|
| |