Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1103 |
Symbol | |
ID | 8806863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1172951 |
End bp | 1176112 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | integrase family protein |
Protein accession | YP_003460351 |
Protein GI | 289208285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0496625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACG CCCCGGAAAC CCAGCAGGTC TGTTCGCTCT GGCGTCACCT TGTGGAGCCG ATGGAAGACC TGTTAGCCCA CAGGGATCAT CTCTGGTGCG TTCGCGACGA TCTTTCGCGG GATACCGGTT CGGAGCTTGG CAGAGTCATT TCGGCCATCG AAAAGCATCA CCCGGAGGTC CTTAGGCGAG ACCGGATTCT CAAGTCGGTC TGGTCCCGTG AACAGTTTGT GGCCCTGATG GACACGGTCC GTCGAGCGGC CCCGCGGGGG TCCTCTTTTC GGGCACGCTG GCGGGTAGCC GAAGTGCTCG ATGCTGGTAA TCGATTGGGT GTGTGGAAGA TCCCACTGTA TGACATTCCA GCTCCGCTCC CACCGCCCGG GAGACCGATT TTTCGCCCTG ACAGCTTCGA GCGCATGGTT CAGTACGAGC CGTTGTATGC GCGTTTCGTG GATGAGGTGA GCGCTGATCT TGAGCACCTA GATGACGGGC AGCTTGGATG GGGACAGATT CTCGGCTCCC TCCTGCTGTT TGGTGGTCTG GTGCAGCCAG CGTGGCTTGA CGCGGTCCCG AATTCGCTCA AACATGCGCC GGCGCACTTA CATTGGCTGG ATATCGAGCG GTCATTGCCC GACAACCAGC GACCGGTGAT TCGTCGGCAT TACCTGGATC CGATCACGCG TTCGGTGGTG ATGCACTGGA GAGACGATGG TTGGCCGGAG ATGCCTCCTG GTAAGGGGCG GAGCCCGGCG TTTCTCAGTC GCGTGATTGG TTCCTACTTA CGGCGTTTGG ACCCCAGCCT GCGGATTCCG GACAACTGGA CGGAATTGCG CGACTTCGTC GAAACCCGTC TGGCCTTGTA CGTCCCACCG CACCTGATGG GGTACGCCAC GGGCTTTTAC AACTCCGTAT CGTTGTCGCC GGAAGTAATT CAGCGAATAG AGTTCCCGCC TGGTCAGGTG CCTTCGGGCG AAGGACTCGT CACTGTTGAC TCTGAAGGCG TAGTGCCAGT AGGACAGTCG GCTGTTGGTG AAGATTCTGA GAGCCAGGGC AGGCCAATTG CCCCGCCGGC GGGCCCCTGG ATTCGCGAAT TGGGAAGTGC AATCCGTGGG GGATCGGCGG CGGACCCCAA ACGTGTGGCG GCCTGGCTTG AACGCCAAGC GAATGGTGAG GGATCTGAGC CTGACTCGGT ACCACCGAGT GTTGTGAAGA TGGGAGAGTG GGCCCAGCGT TGGCTGTTTT CGAGCCGTGC CGGTGCCCGA CCCATGAAAC CCAAGACGGC CTACGATCGC TTCAATGCAT TGGCCGTCCC TCTCGCTGGC TTTCTCGGCA ACGAGGATCC GGCCGCGTTC GAATGCGTGG ACGACTTCGT GGAGGTCTAC ACCGCCGTCC TTGAAACAGC CGATACTTTG TCAAAGCGCA AGCGGTTAGC CGCAGCTCTG GCGAGCTTTC ATGACTTCCT GCGGAACGCG CACGGGGCAC CAGATATTGC GTCGGCGGGT CTCTTCACGG TTCGCGGGCG CCAGCCCCAC GCCGTCGATG CGAATTTCAT CGAACCGCGC GCGTTTGAAT GGGCGGTACG CTGGTTGGAC TATCGTTACG CCGAGGATCC TGAGCTGCGG GAGAGCCTCG TTTTGATCGC GTGTCTCGGG TATTTCGCGG GCCTGCGGCG CTCTGAGGCG ATTGGGCTGC TCATTGGTGA TCTTGATGGC GAACCCGCCT GGGACTGCGT CGTGCGACCG AACCGAAATC GGGGGCTCAA AAGCTCAAGC GCGCACCGAG TCGTCCCACT GGGCGTGCTC CTGCCTGCGC GATACCTGAA CCGACTGCAG AAGTGGTGGG CTGAGCGTCG AAAAACGATG CTCGCGGAAG GTGGCGATCC TGCGACGGTG CCGCTGTTCG ACCGTCCCCG AAAAAAGGGA GCGGAAAACG ACAATCTGCG GCGCTTCGAT CGGGACCTGG AACGAGTTAC CGATGCCTTA ATACGGGTGA CGGGAGATGG GGGGTTGCGT TATCACCACC TGCGGCACTC GTTCGCCAAT CAGCTCCTGC TCGCGTTGTG GCGGCACGAG CACCGCGATG ATGCGCTGGT GGTCGAACGG CTTGATCCCC TGATCGGTTT TGAAGACGTC GCGACGTTAC GAACAACCCT CCTTGGCGAC TCCCCGGTTC AGCGCCGCAG CCTGCGGCTG ATCAGTGCCT TGATGGGGCA TCTGACGACC GAGATCACGA TGAATCACTA CATTCATCTG ACGGATCTTG TTTGGGGGCA AGCCGTGCGG GGAGCGCTTC CCCCGCTCGA ATTCCATGAT GTGGCTCGAA TACTGGGTGT GTCTCTCAAG CATGTTCAAC GAAGCGAGCG AACCTTCCAA ACGGGGAACC CGGTACTTCT GCTTGAAAGG ATGCTGGACC GTCATCTCGG CAGCCCTGTC CCTGCGGACG TGGATGCAGA AGAGCCCAAG CAGCTACTCC CGCCGCGTGA TCCTCATGCA GCACTCCTGT CCTTGACTGA CCACCTGAAT CGCACTTCGC AGGACGATTC GAAAGTCGTT CATTTCGTGG GGCCTTGGCA AGGGCCCACT CTCGATGTGT TGAGGGCCTG GCTGTCGGAC GTTCCGGAGC ACTTTCGCAG ACCTCCCACA GGGCGCCATG AACGGATCAT TGCGCCACCC CGGACTCAGC AGGGGCTGGG TTTATCGAAG GAAGCGGTCC AGTTACTCCT GAGTGTGAAA GGCGCCTGGG GCAAGGAAGA ACGCGCCCGC CTGCTTCGGA TCTTTCTGTC AGGGAAGCTG GCGAGCGGTC CGCTCGACGT TGTCCTGGCG ACCTTGCCCG CGCTGGATCT TTGGGTGCGG TTCCTCCGAG AGATCCAGCT CGATGATGCC TTCGAATACA TGCATTTCTC TGGTAAGGGT GGAGGCCGAG AAACGCCGAT GGGGCAGTAT CAATACTGGG CGCAGCGCGC ACCGGTCGCG CTCGAGTCTG GTGACGCCAA TCCCGAGCCG TTCGCGGAGC ATCTCCCCGA GAAACGAAGA GGCGTCGTGG TTGCCCGGCA CCGACGAAGT GGCGAGGCTG GCCGGCACCG CTGGGTGTAC GGGGTCTGTT GGGCACTCGT TATGCTGAAG GCGAACGACG AGCATCCAGA ACTGGCACTG CTGACGCCAT AG
|
Protein sequence | MSDAPETQQV CSLWRHLVEP MEDLLAHRDH LWCVRDDLSR DTGSELGRVI SAIEKHHPEV LRRDRILKSV WSREQFVALM DTVRRAAPRG SSFRARWRVA EVLDAGNRLG VWKIPLYDIP APLPPPGRPI FRPDSFERMV QYEPLYARFV DEVSADLEHL DDGQLGWGQI LGSLLLFGGL VQPAWLDAVP NSLKHAPAHL HWLDIERSLP DNQRPVIRRH YLDPITRSVV MHWRDDGWPE MPPGKGRSPA FLSRVIGSYL RRLDPSLRIP DNWTELRDFV ETRLALYVPP HLMGYATGFY NSVSLSPEVI QRIEFPPGQV PSGEGLVTVD SEGVVPVGQS AVGEDSESQG RPIAPPAGPW IRELGSAIRG GSAADPKRVA AWLERQANGE GSEPDSVPPS VVKMGEWAQR WLFSSRAGAR PMKPKTAYDR FNALAVPLAG FLGNEDPAAF ECVDDFVEVY TAVLETADTL SKRKRLAAAL ASFHDFLRNA HGAPDIASAG LFTVRGRQPH AVDANFIEPR AFEWAVRWLD YRYAEDPELR ESLVLIACLG YFAGLRRSEA IGLLIGDLDG EPAWDCVVRP NRNRGLKSSS AHRVVPLGVL LPARYLNRLQ KWWAERRKTM LAEGGDPATV PLFDRPRKKG AENDNLRRFD RDLERVTDAL IRVTGDGGLR YHHLRHSFAN QLLLALWRHE HRDDALVVER LDPLIGFEDV ATLRTTLLGD SPVQRRSLRL ISALMGHLTT EITMNHYIHL TDLVWGQAVR GALPPLEFHD VARILGVSLK HVQRSERTFQ TGNPVLLLER MLDRHLGSPV PADVDAEEPK QLLPPRDPHA ALLSLTDHLN RTSQDDSKVV HFVGPWQGPT LDVLRAWLSD VPEHFRRPPT GRHERIIAPP RTQQGLGLSK EAVQLLLSVK GAWGKEERAR LLRIFLSGKL ASGPLDVVLA TLPALDLWVR FLREIQLDDA FEYMHFSGKG GGRETPMGQY QYWAQRAPVA LESGDANPEP FAEHLPEKRR GVVVARHRRS GEAGRHRWVY GVCWALVMLK ANDEHPELAL LTP
|
| |