Gene Acid345_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0959 
Symbol 
ID4072947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1217050 
End bp1218195 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content60% 
IMG OID637982966 
Productserine phosphatase 
Protein accessionYP_590036 
Protein GI94967988 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCG CCGCTCCAGC TCCGCAGAAA CCGCGTCCTC TTCCCGGTAC CGGTTGGTCT 
CGATTCTGGA ACCGTGTGAC CGAGGGCATG GCAATGGACC AGCTCTGGTC GCAGTTTCGC
GCCGATACAC GCGCTGGCTA CCGCTTCTAC TCCAAAGAAA TCAACCAGGA CCGCACTTCG
GGCATGAAGC CCGGCCACAA GTTCTGGTTC ATGGCGGAAC AGTTCTTCTG GGCGGTGATG
GAAAAGCTCT CGCCCGCCCG CCGCGTGCTG TTGCTGGTTG CGCTGGTTTC GTTGCTCTTC
GGACAGATCA CGTGGCACAC CACCCAGAAC CAGACCATCG TCGTCAATCC GCAGTTCTGG
GGCGGACTGC TGATGTTCCT GCTGCTGATC CTGGAGATCG GCGACCGCGT GATCATGAAG
CGCGATTTGC AGATCGCTAA AGAAATCCAG CAGTGGCTGC TGCCCGCGAC GCCTCCGATT
GTCCCGGGGA TGGAAGTCGC GTTTGTTACG CGGCCGGCGA ACACGGTGGC CGGAGACTTT
TACGATGTCT TCCCGCGCAC CGGCGGAAAC GACCACTGGC TGATCGCCTG CGCTGACGTG
GCGGGAAAGA GCATGCCGGC AGCCCTGTTG ATGGCGACCT TCCAGGCCAG CTTGAAAACT
CTCTCCACCG CCGACATCTC GCTGCCCGAT CTCGCGGCGA GCATGAATAA ATACGCCTGC
ACCAACAGCC AGCAGGGGCG TCGTTTCACC ACGGCATTCC TGGCGGAATA TACGCCGGCG
ACACGGACAT TGACGTACAT CAATGCTGGT CATAACAACC CAATGCTGCG GCGCGCGACG
GGACAGCTCG AGCGGTTGGA TATCGGCGGC ATTCCGCTGG GCATGATGGA AGACGTGACC
TACCCATCAG CGACACTGAC TTTGAATCCG GGCGACTGGC TGGTCATCTT TACCGATGGC
GTAGTCGAGG CCGAGAACGA CCATGCGCAG GAATACGGGG AAGACCGGTT GATCGCAGTC
GTCAACGCGA ACCTGCAGCT AGGCCCGGCG CAGATGCTTT ACCAGATCAT GACGGACGTG
GACCGGTTCG TCGCCATGAC ACCCCAGCAT GACGACATCA CGTGTTTGCT GATCAAAGCG
GTGTAG
 
Protein sequence
MSSAAPAPQK PRPLPGTGWS RFWNRVTEGM AMDQLWSQFR ADTRAGYRFY SKEINQDRTS 
GMKPGHKFWF MAEQFFWAVM EKLSPARRVL LLVALVSLLF GQITWHTTQN QTIVVNPQFW
GGLLMFLLLI LEIGDRVIMK RDLQIAKEIQ QWLLPATPPI VPGMEVAFVT RPANTVAGDF
YDVFPRTGGN DHWLIACADV AGKSMPAALL MATFQASLKT LSTADISLPD LAASMNKYAC
TNSQQGRRFT TAFLAEYTPA TRTLTYINAG HNNPMLRRAT GQLERLDIGG IPLGMMEDVT
YPSATLTLNP GDWLVIFTDG VVEAENDHAQ EYGEDRLIAV VNANLQLGPA QMLYQIMTDV
DRFVAMTPQH DDITCLLIKA V