Gene TK90_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_0633 
Symbol 
ID8806382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp673560 
End bp674966 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content62% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003459884 
Protein GI289207818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0599243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC AGACCCCAGG CAATCACAAC CAGAACGGCG AAGGTGCCGG TGCCGGCACC 
ATGAAGCCCG GCTACCACAA TACCGACGGT CAGGGCATCG CCGGTCACGG TTCCTTCTTC
CAGGCGACCG ACCTGTCCAA GGAAGACGCG ATCAAGGCGA CCGACTGGGT GCGCAAGCAC
GTCGACCGCC GTACGGTCGA CCTTGGTGAC CGTATGGACG ACGTCCGCGA GCACATGTAC
GAGCTCGAAA AAGACGGTCA GATCATCATC CACCGCATCG AGGACCAGCA CGAGCCGATG
TCGGTGAAAA CCCTGTTCGG CTGGGACAAG AAGGTCCCGA CCAAGCAGCT CTGGCACCAC
AAGTCCTGTG GCCAGTGCGG GAACATCCCG GGCTACCCGA CCTCGCTGCT GTGGTTCATG
AACAAGTTCG GCTTCGAGCC GGGCAAGGAC TACCTCGACG AGACCGACCA GACCTCCTGC
ACCGCGTGGA ACTACCACGG TTCCGGTATC GGGAACGTCG AGTCGCTGGC CGCGGTGTTC
CTGCGCAACT TCCACCAGGC CTACGTCTCC GGCAAGCAGC ACGGCCATGA GCTGGGTCAC
TTCTACCCGC TGGTCCATTG CGGCACCTCC TTCGGGAACT ACAAGGAGAT CCGCAAGTAT
CTGGTCGAGT CCGCCGAGCT GCGCGAGAAG GTCACCAAGA TCCTCGGCAA GCTGGGTCGC
CTGGTCGACG GCAAGCTGGT CATCCCGGAA GAGGTCATCC ACTACTCCGA GTGGGTGCAC
GTGATGCGCA ACCGCATCGC CTCCGAGCTG CAGACCATCG ACGTCTCCAA TATCCGTACC
ACTGCCCACG TGGCCTGCCA TTACTACAAG ATGGTGCACG AGGACGCGGT GTACGACCCG
TCCGTGCTGG GTGGCAACCG TACCGCGATC ATCACCTCCA CCGCCCAGGC CCTGGGTGCG
CAGGTGATCG ACTACTCCAC GTGGTACGAC TGCTGCGGCT TCGGCTTCCG CCACATCATC
TCCGAGCGCG AGTTCACCCG CTCGTTCACG ATGGATCGCA AGATCCGCGT GGCCCGCGAG
GAAGCCAACG CCGACGTGAT GCTGGCCAAC GACACCGGCT GCGTCACGAC CATGGACAAG
AACCAGTGGA TCGGTAAGGC CCACAACCAG AACTTCCAGA TTCCGATCAT GGCGGAAGTC
CAGTTCGCGG CCCTGGCCTG CGGCGCGGAT CCGTTCAAGA TCGTCCAGCT GCAGTGGCAC
GCATCGCCCT GCGAAGACCT GGTCGAGAAG ATGGGGATCT CCTGGGACGA GGCCAAGAAG
ACCTTCCAGG AATACCTGAA GGAAGTCGAG GCCGGCAATA TCGAATACCT CTACAATCCT
GAGCTCGCTT ACGGGGGCAA AGTCTGA
 
Protein sequence
MSEQTPGNHN QNGEGAGAGT MKPGYHNTDG QGIAGHGSFF QATDLSKEDA IKATDWVRKH 
VDRRTVDLGD RMDDVREHMY ELEKDGQIII HRIEDQHEPM SVKTLFGWDK KVPTKQLWHH
KSCGQCGNIP GYPTSLLWFM NKFGFEPGKD YLDETDQTSC TAWNYHGSGI GNVESLAAVF
LRNFHQAYVS GKQHGHELGH FYPLVHCGTS FGNYKEIRKY LVESAELREK VTKILGKLGR
LVDGKLVIPE EVIHYSEWVH VMRNRIASEL QTIDVSNIRT TAHVACHYYK MVHEDAVYDP
SVLGGNRTAI ITSTAQALGA QVIDYSTWYD CCGFGFRHII SEREFTRSFT MDRKIRVARE
EANADVMLAN DTGCVTTMDK NQWIGKAHNQ NFQIPIMAEV QFAALACGAD PFKIVQLQWH
ASPCEDLVEK MGISWDEAKK TFQEYLKEVE AGNIEYLYNP ELAYGGKV