Gene TK90_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_1103 
Symbol 
ID8806863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp1172951 
End bp1176112 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content61% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003460351 
Protein GI289208285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0496625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG CCCCGGAAAC CCAGCAGGTC TGTTCGCTCT GGCGTCACCT TGTGGAGCCG 
ATGGAAGACC TGTTAGCCCA CAGGGATCAT CTCTGGTGCG TTCGCGACGA TCTTTCGCGG
GATACCGGTT CGGAGCTTGG CAGAGTCATT TCGGCCATCG AAAAGCATCA CCCGGAGGTC
CTTAGGCGAG ACCGGATTCT CAAGTCGGTC TGGTCCCGTG AACAGTTTGT GGCCCTGATG
GACACGGTCC GTCGAGCGGC CCCGCGGGGG TCCTCTTTTC GGGCACGCTG GCGGGTAGCC
GAAGTGCTCG ATGCTGGTAA TCGATTGGGT GTGTGGAAGA TCCCACTGTA TGACATTCCA
GCTCCGCTCC CACCGCCCGG GAGACCGATT TTTCGCCCTG ACAGCTTCGA GCGCATGGTT
CAGTACGAGC CGTTGTATGC GCGTTTCGTG GATGAGGTGA GCGCTGATCT TGAGCACCTA
GATGACGGGC AGCTTGGATG GGGACAGATT CTCGGCTCCC TCCTGCTGTT TGGTGGTCTG
GTGCAGCCAG CGTGGCTTGA CGCGGTCCCG AATTCGCTCA AACATGCGCC GGCGCACTTA
CATTGGCTGG ATATCGAGCG GTCATTGCCC GACAACCAGC GACCGGTGAT TCGTCGGCAT
TACCTGGATC CGATCACGCG TTCGGTGGTG ATGCACTGGA GAGACGATGG TTGGCCGGAG
ATGCCTCCTG GTAAGGGGCG GAGCCCGGCG TTTCTCAGTC GCGTGATTGG TTCCTACTTA
CGGCGTTTGG ACCCCAGCCT GCGGATTCCG GACAACTGGA CGGAATTGCG CGACTTCGTC
GAAACCCGTC TGGCCTTGTA CGTCCCACCG CACCTGATGG GGTACGCCAC GGGCTTTTAC
AACTCCGTAT CGTTGTCGCC GGAAGTAATT CAGCGAATAG AGTTCCCGCC TGGTCAGGTG
CCTTCGGGCG AAGGACTCGT CACTGTTGAC TCTGAAGGCG TAGTGCCAGT AGGACAGTCG
GCTGTTGGTG AAGATTCTGA GAGCCAGGGC AGGCCAATTG CCCCGCCGGC GGGCCCCTGG
ATTCGCGAAT TGGGAAGTGC AATCCGTGGG GGATCGGCGG CGGACCCCAA ACGTGTGGCG
GCCTGGCTTG AACGCCAAGC GAATGGTGAG GGATCTGAGC CTGACTCGGT ACCACCGAGT
GTTGTGAAGA TGGGAGAGTG GGCCCAGCGT TGGCTGTTTT CGAGCCGTGC CGGTGCCCGA
CCCATGAAAC CCAAGACGGC CTACGATCGC TTCAATGCAT TGGCCGTCCC TCTCGCTGGC
TTTCTCGGCA ACGAGGATCC GGCCGCGTTC GAATGCGTGG ACGACTTCGT GGAGGTCTAC
ACCGCCGTCC TTGAAACAGC CGATACTTTG TCAAAGCGCA AGCGGTTAGC CGCAGCTCTG
GCGAGCTTTC ATGACTTCCT GCGGAACGCG CACGGGGCAC CAGATATTGC GTCGGCGGGT
CTCTTCACGG TTCGCGGGCG CCAGCCCCAC GCCGTCGATG CGAATTTCAT CGAACCGCGC
GCGTTTGAAT GGGCGGTACG CTGGTTGGAC TATCGTTACG CCGAGGATCC TGAGCTGCGG
GAGAGCCTCG TTTTGATCGC GTGTCTCGGG TATTTCGCGG GCCTGCGGCG CTCTGAGGCG
ATTGGGCTGC TCATTGGTGA TCTTGATGGC GAACCCGCCT GGGACTGCGT CGTGCGACCG
AACCGAAATC GGGGGCTCAA AAGCTCAAGC GCGCACCGAG TCGTCCCACT GGGCGTGCTC
CTGCCTGCGC GATACCTGAA CCGACTGCAG AAGTGGTGGG CTGAGCGTCG AAAAACGATG
CTCGCGGAAG GTGGCGATCC TGCGACGGTG CCGCTGTTCG ACCGTCCCCG AAAAAAGGGA
GCGGAAAACG ACAATCTGCG GCGCTTCGAT CGGGACCTGG AACGAGTTAC CGATGCCTTA
ATACGGGTGA CGGGAGATGG GGGGTTGCGT TATCACCACC TGCGGCACTC GTTCGCCAAT
CAGCTCCTGC TCGCGTTGTG GCGGCACGAG CACCGCGATG ATGCGCTGGT GGTCGAACGG
CTTGATCCCC TGATCGGTTT TGAAGACGTC GCGACGTTAC GAACAACCCT CCTTGGCGAC
TCCCCGGTTC AGCGCCGCAG CCTGCGGCTG ATCAGTGCCT TGATGGGGCA TCTGACGACC
GAGATCACGA TGAATCACTA CATTCATCTG ACGGATCTTG TTTGGGGGCA AGCCGTGCGG
GGAGCGCTTC CCCCGCTCGA ATTCCATGAT GTGGCTCGAA TACTGGGTGT GTCTCTCAAG
CATGTTCAAC GAAGCGAGCG AACCTTCCAA ACGGGGAACC CGGTACTTCT GCTTGAAAGG
ATGCTGGACC GTCATCTCGG CAGCCCTGTC CCTGCGGACG TGGATGCAGA AGAGCCCAAG
CAGCTACTCC CGCCGCGTGA TCCTCATGCA GCACTCCTGT CCTTGACTGA CCACCTGAAT
CGCACTTCGC AGGACGATTC GAAAGTCGTT CATTTCGTGG GGCCTTGGCA AGGGCCCACT
CTCGATGTGT TGAGGGCCTG GCTGTCGGAC GTTCCGGAGC ACTTTCGCAG ACCTCCCACA
GGGCGCCATG AACGGATCAT TGCGCCACCC CGGACTCAGC AGGGGCTGGG TTTATCGAAG
GAAGCGGTCC AGTTACTCCT GAGTGTGAAA GGCGCCTGGG GCAAGGAAGA ACGCGCCCGC
CTGCTTCGGA TCTTTCTGTC AGGGAAGCTG GCGAGCGGTC CGCTCGACGT TGTCCTGGCG
ACCTTGCCCG CGCTGGATCT TTGGGTGCGG TTCCTCCGAG AGATCCAGCT CGATGATGCC
TTCGAATACA TGCATTTCTC TGGTAAGGGT GGAGGCCGAG AAACGCCGAT GGGGCAGTAT
CAATACTGGG CGCAGCGCGC ACCGGTCGCG CTCGAGTCTG GTGACGCCAA TCCCGAGCCG
TTCGCGGAGC ATCTCCCCGA GAAACGAAGA GGCGTCGTGG TTGCCCGGCA CCGACGAAGT
GGCGAGGCTG GCCGGCACCG CTGGGTGTAC GGGGTCTGTT GGGCACTCGT TATGCTGAAG
GCGAACGACG AGCATCCAGA ACTGGCACTG CTGACGCCAT AG
 
Protein sequence
MSDAPETQQV CSLWRHLVEP MEDLLAHRDH LWCVRDDLSR DTGSELGRVI SAIEKHHPEV 
LRRDRILKSV WSREQFVALM DTVRRAAPRG SSFRARWRVA EVLDAGNRLG VWKIPLYDIP
APLPPPGRPI FRPDSFERMV QYEPLYARFV DEVSADLEHL DDGQLGWGQI LGSLLLFGGL
VQPAWLDAVP NSLKHAPAHL HWLDIERSLP DNQRPVIRRH YLDPITRSVV MHWRDDGWPE
MPPGKGRSPA FLSRVIGSYL RRLDPSLRIP DNWTELRDFV ETRLALYVPP HLMGYATGFY
NSVSLSPEVI QRIEFPPGQV PSGEGLVTVD SEGVVPVGQS AVGEDSESQG RPIAPPAGPW
IRELGSAIRG GSAADPKRVA AWLERQANGE GSEPDSVPPS VVKMGEWAQR WLFSSRAGAR
PMKPKTAYDR FNALAVPLAG FLGNEDPAAF ECVDDFVEVY TAVLETADTL SKRKRLAAAL
ASFHDFLRNA HGAPDIASAG LFTVRGRQPH AVDANFIEPR AFEWAVRWLD YRYAEDPELR
ESLVLIACLG YFAGLRRSEA IGLLIGDLDG EPAWDCVVRP NRNRGLKSSS AHRVVPLGVL
LPARYLNRLQ KWWAERRKTM LAEGGDPATV PLFDRPRKKG AENDNLRRFD RDLERVTDAL
IRVTGDGGLR YHHLRHSFAN QLLLALWRHE HRDDALVVER LDPLIGFEDV ATLRTTLLGD
SPVQRRSLRL ISALMGHLTT EITMNHYIHL TDLVWGQAVR GALPPLEFHD VARILGVSLK
HVQRSERTFQ TGNPVLLLER MLDRHLGSPV PADVDAEEPK QLLPPRDPHA ALLSLTDHLN
RTSQDDSKVV HFVGPWQGPT LDVLRAWLSD VPEHFRRPPT GRHERIIAPP RTQQGLGLSK
EAVQLLLSVK GAWGKEERAR LLRIFLSGKL ASGPLDVVLA TLPALDLWVR FLREIQLDDA
FEYMHFSGKG GGRETPMGQY QYWAQRAPVA LESGDANPEP FAEHLPEKRR GVVVARHRRS
GEAGRHRWVY GVCWALVMLK ANDEHPELAL LTP