Gene TK90_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_0404 
Symbol 
ID8806139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp420208 
End bp421338 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID 
Productprotein of unknown function UPF0075 
Protein accessionYP_003459655 
Protein GI289207589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000929624 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC GGTTAATTGG ATTGATGTCC GGCACCAGCC GGGACGGCGT GGACGCCGTC 
CTGGTCGAGA TTGAAGGAGA CGGCTCGCTG CATACCGCGG GGCATTGCCA CCTGCCATAC
CCGGATGCGC TGGAGGAGGA TCTCGCAGCC GCCGCGACCG CCGAGGCACT GCGCTTCGAG
CACCTGGGCA CACTGGACGC CCGGGTTGGG CTGTTCCTGT CGCGCGCCGT CCAGCAACTG
CTGGGATCCA CCGGCCTGAA GGCCCAGGAC ATCCTGGCCA TCGGCTCACA CGGGCAAACG
GTCCACCATG CGCCCGGCGC CGACCCCGCG TTCACCTGGC AGATTGGCGA CCCATTTCGA
ATCGCCGAAG CCACCGGCAT CGACGTAATC GCGCACTTTC GCCAACGCGA CCTCGCGGCG
GGTGGCGAGG GGGCTCCGCT GGCCTGCGCG TTCCATGCCG CCTGGCTGGG CCACGCGTCC
GAGACACGCG CGATCCTGAA CCTCGGGGGG ATCGCCAATC TGACCTGGCT GGAGCCGGGC
CAGCCGGTAC GCGGATGCGA CAGCGGACCT GCCAACACCC TGCTGGACGG CTGGGCCCGG
CGGCACCTGG GCCAGCCCTA TGATGCCGAT GGCTCCTGGG CGCGCACGGG ACACGTCGAC
CGGTCACTGC TCGAACAACT CCTAGCGGAC CCGTACTTTC AGCGCCCCGC CCCCAAGAGC
ACGGGGCCCG AGCATTTCTC GCCCCACTGG CTGCGCCAGG TTGGTGGCGA ACGGATCGAT
CGCCTGAACA CAGAGGACGT TCAGGCCACC CTGGTCGAAC TTACGGTGGA GGGCGTGCGG
CTGACGCTCG AATCCCTGCG CACAACCGCA CCCGATCGGG TCATCGTCTG TGGCGGAGGC
GCCCACAACG GCTACCTGAT GGAACGGCTG CAAAGCCAGC TGGCCGGCAG CACCGTCGAG
ACCTCCGAAC GCCACGGGAT ACCCCCTCAG CAGGTGGAGG GCGCCGCCTT TGCCTGGCTG
GCGTATCGCC ACCTGCAACA AGAGGCCGGC AATCTGCCGG AGGTCACAGG TGCCCGTGGG
CCACGCATCC TCGGCTGCCG GATCCCCGGA CGCGCACCTG AATCCACATA A
 
Protein sequence
MTQRLIGLMS GTSRDGVDAV LVEIEGDGSL HTAGHCHLPY PDALEEDLAA AATAEALRFE 
HLGTLDARVG LFLSRAVQQL LGSTGLKAQD ILAIGSHGQT VHHAPGADPA FTWQIGDPFR
IAEATGIDVI AHFRQRDLAA GGEGAPLACA FHAAWLGHAS ETRAILNLGG IANLTWLEPG
QPVRGCDSGP ANTLLDGWAR RHLGQPYDAD GSWARTGHVD RSLLEQLLAD PYFQRPAPKS
TGPEHFSPHW LRQVGGERID RLNTEDVQAT LVELTVEGVR LTLESLRTTA PDRVIVCGGG
AHNGYLMERL QSQLAGSTVE TSERHGIPPQ QVEGAAFAWL AYRHLQQEAG NLPEVTGARG
PRILGCRIPG RAPEST