Gene Acid345_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1007 
Symbol 
ID4069772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1271077 
End bp1272033 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content58% 
IMG OID637983014 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_590084 
Protein GI94968036 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0460692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0531219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATCA AGAAGCAGGG TGAGATTCCG TCGTCGGAAA TTACGGACAA AAAGGTGTAC 
CTGAACCGTC GTGCGTTTAT CGGCGGGGCG GCTGCGGCTG GGGCGGCGAT TGCGGTGGGA
TTCAAGGCGG CGGGGCTATT CGATCCGGCG CTGCACGCGA GCGCGAATGC GAAGTTGCAG
TTCAAGCCGA GCAGCTTCAG CACGAACGAG AAGCAGACGC CGCTGAACGA CGTGACCCAC
TACAACAACT ATTACGAGTT CGGCACCGAC AAAACCGATC CGGCAGACGA GGCCAAAAAT
TTCAAACCAA CGCCGTGGAA GGTGAAGGTG GAAGGCCTGG TCAAGAAGGC GCAGACCTTC
GACATTGACA CGTTGCTGAA GATCCCGCTG GAGGAGCGCG TGTATCGCAT GCGCTGCGTC
GAGGGATGGT CGATGGTGAT TCCGTGGATC GGATTTCCGT TGTCGGCGCT TTTGAACCAA
GTGGAAGTGC AGCCGAAGGC GAAGTTCGTG GAGTTTACCT CGCTGCTCGA TCCGAATCGC
ATGCCGGGGC AGCGGAGGGC GGTGCTGGAA TGGCCGTATG TGGAGGGGTT GCGGCTGGAT
GAGGCGATGC ATCCGCTGAC GACGATGGTG GTAGGCCTGT ATGGCGAGAC GCTTCCGAAC
CAGGATGGCG CGCCGTTGCG ATTAGTGGTG CCGTGGAAGT ATGGGTTCAA GGGGATCAAG
GCGATCGTGA ACATCAAGCT GGTGGAGAAA CAGCCGACCT CGACGTGGAC GCAGGCGGCG
TCAAACGAAT ATGGTTTCTA CTCCAATGTG AATCCGAACG TGGACCATCC GCGATGGAGC
CAGGCGAAGG AGCGGAGGAT CGGGGAGTTT TTCAAGCGTC CGACGCTGAT GTTTAACGGG
TACGGCGACC AGGTGGCGAG TTTGTATTCG GGCATGGATT TGAAGAAAAA CTTCTAA
 
Protein sequence
MLIKKQGEIP SSEITDKKVY LNRRAFIGGA AAAGAAIAVG FKAAGLFDPA LHASANAKLQ 
FKPSSFSTNE KQTPLNDVTH YNNYYEFGTD KTDPADEAKN FKPTPWKVKV EGLVKKAQTF
DIDTLLKIPL EERVYRMRCV EGWSMVIPWI GFPLSALLNQ VEVQPKAKFV EFTSLLDPNR
MPGQRRAVLE WPYVEGLRLD EAMHPLTTMV VGLYGETLPN QDGAPLRLVV PWKYGFKGIK
AIVNIKLVEK QPTSTWTQAA SNEYGFYSNV NPNVDHPRWS QAKERRIGEF FKRPTLMFNG
YGDQVASLYS GMDLKKNF