Gene TK90_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_1511 
Symbol 
ID8807277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp1608713 
End bp1610101 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content65% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003460751 
Protein GI289208685 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.641455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00827535 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGC GTCTGGTATT CGATCCTGAA GTCTTGCGTC GCTATGACGT CAGCGGCCCT 
CGCTACACGT CGTACCCCAC GGCGCCGCAG TTCCACGAGG CGTTCGACGC ACAGGCCTAT
GCGCGGGTGG CGCGGGCGAC GAATGTCAAA GGGGCGGCGC GACCGCTCTC GCTCTATGTC
CACGTGCCCT TCTGCGACAC GGTGTGTTTC TACTGCGCCT GCAACAAGGT GATTACCGGC
AATTATCGGC GGGCGACGAG CTATCTCGAC TCGCTGGAAC AGGAGATTGC CCTGCAGGGC
GAGTGGTTCG ATCGCGATCG GCCGGTGCGC CAGCTGCACT TCGGGGGCGG GACGCCGACG
TATCTGTCCG ACGAGGATCT GACCCGGGTC ATGAAGGCCC TGGGTCGACA TTTCACGCTG
GAGACGGGGC CGGAGCGGGA ATTCTCGATC GAGATCGATC CGCGTGCCGT GCGTCCCGGT
ACCCTGCCGC TTCTGGCATC GCTGGGCTTC AACCGCATCA GTGTGGGGGT GCAGGATGTC
GATCCCGCCG TGCAGAAGGC CGTCAACCGC ATCCAGCCAT TCGAGGTCAC CGAGCAGGCG
GTCCGCCAGG CGCGCGAGCA CGGCTTTGTC TCCACCAACC TGGACCTGAT CTACGGGCTG
CCCCTGCAGA CGGTGGACAC ATTCTCCGCG ACACTGGACC GCGTGCTGGA ACTCCGCCCG
GAGCGGCTGG CCGTATACAA CTACGCGCAC CTGCCGGAGC TGTTCAAGAC GCAGCGCCAG
ATCCGCGACG GCGACCTGCC GCCGGCCACG GAGAAGCTTG CGGTCCTCGA GATGACCATC
CAGCGCCTGA CCGACGCCGG GTATGTGTAC ATCGGCATGG ATCACTTTGC GCTGCCCGAT
GACGAACTCG CGATCGCCCA GCGTGCGGGC ACCCTGCAAC GAAATTTCCA GGGCTACTCG
ACGCGGGCGG AGTACGACCT CGTGGCACTC GGACCCACCG CCATCGGCAA GATCGGGGAC
AGCTACAGTC AGAACCTGCG CGAGGTGGAT GCCTATCAGG AACGGCTGGC GGCGGGCCAG
TTGCCCGTTT TCCGCGGCCT GGAACTGAGC GAGGACGACC GTCTGCGGCG TGCCGTGATC
AGCGAGCTGA TGTGCCACTC GAGGATCGAC TTCGGCGGCA TCGAGGCCAC ATTCGGCATC
GACTTCCGCG AGACCTTCGC CGATGCCCTG AACCGCCTGG CCGAGATGGA GGCCGATGGC
CTCGTCACCA TCGGCCGCGA CAGCCTGGAG GTCCAGCCAC GCGGCCGACT CCTGCTGCGC
AACATCGCCA TGGCCTTCGA CGCCTACCTC AACCGCGAGG AACGCCGTCG CTACTCGAAG
GTCGTCTAA
 
Protein sequence
MEQRLVFDPE VLRRYDVSGP RYTSYPTAPQ FHEAFDAQAY ARVARATNVK GAARPLSLYV 
HVPFCDTVCF YCACNKVITG NYRRATSYLD SLEQEIALQG EWFDRDRPVR QLHFGGGTPT
YLSDEDLTRV MKALGRHFTL ETGPEREFSI EIDPRAVRPG TLPLLASLGF NRISVGVQDV
DPAVQKAVNR IQPFEVTEQA VRQAREHGFV STNLDLIYGL PLQTVDTFSA TLDRVLELRP
ERLAVYNYAH LPELFKTQRQ IRDGDLPPAT EKLAVLEMTI QRLTDAGYVY IGMDHFALPD
DELAIAQRAG TLQRNFQGYS TRAEYDLVAL GPTAIGKIGD SYSQNLREVD AYQERLAAGQ
LPVFRGLELS EDDRLRRAVI SELMCHSRID FGGIEATFGI DFRETFADAL NRLAEMEADG
LVTIGRDSLE VQPRGRLLLR NIAMAFDAYL NREERRRYSK VV