Gene A9601_00101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00101 
SymbolrsbU 
ID4716692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp13122 
End bp14465 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content27% 
IMG OID640077707 
Productprotein phosphatase 2C domain-containing protein 
Protein accessionYP_001008405 
Protein GI123967547 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACAAACT ATCAAAAAGA AAAAATATTT TCGAACAAAT TTATTAAAAA TTTTTTAGAA 
AACGAATCTA CAGAAATTTT AAAAAATAAA TATAAATTTG CTGAAATTGC ATCTTCACTA
GCATATTATT TAAAATCGTT TTCCAACATA AATAAATTAT TAGATTATAT TTCTTTAATT
TTTAAACATA TTTTTTCTGA GAATATAATT TTAATTATTC CTTTAAATTA TGAGGGTGAT
ATATGGAATG AAAATATAAA AATTTCTGTT AATGATAAAT ATTTAACAAT TCAAAAAGAA
ATCAATAAAT TTTTGAATCA ATTTCATTTT TCAAAAAATT TTAAAATAAA AGAAATTTTA
ACTTTTGAAA ATGCTTTAAA AAATAATTTT AAAGAATATA AAATTGAAAC AAAAAAAATA
ATATCTAGAG GTAAATGTAG AGGATTTATT TATATTTTTA GCAAAGATAT TTATATACAG
TCGATTACTG AAGATAGTAA TTTTAATTTT ATTGAAAATT GTCTAGCTGT TGGATTAGAA
AATCACTATT TATTAAAAAC AAAGAAAAAG CATGAAAACG TAGATAGAGA AATCTCCACT
GGTGCTGAAA TTCAATCTCA ATTACTTCCG GATTATTGCC CAATTATCCA TGGTATAGAT
TTAGCAGCTC ATTGTAGACC AGCTCTTCAG CTCGGAGGGG ATTACTATGA TTTTATGTGC
TTGAAGACGA ATATCTCTGA AAAAAGAAAA GAAAAATCAA GATGGGCTTT TGTTATAGGT
GATGTCATGG GTAAAGGGAT TCCGGCTGGC CTTTTAATGA CGATGTTGAG AGGAATGCTA
CGCGCTGAGG TTCTTACAGG TCTGCCTCCA GATAGAATTT TGCATGATTT GAATCAACTA
GCAATAAATG ATTTAGATCA ATCACATAGA TTTGTGACTT TATTTTACTC AGATTATGAC
CCTAGAACTA GAAAATTGAG ATTCGCTAAT GCAGCACATA ATCCTCCTCT GCTTTGGAAA
AGTTCAGATC AGAAAATTAT TAAATTAGAT GCAGAAGGAT TTGTACTTGG ACTACAAAAA
GATGCAGAAT ACCAATGTGG TGAAATAAAG CTTAATCAAA ATGATTTAGT TCTCTATTAC
ACAGATGGAG TAATAGATAC TTCTAATTCC TTAGGGCAAA GATTTGACGA GGAAAGGTTA
ATTAAAACGC TTACAAAATT TTGCAAGCAA TCATATTCAT CCCAAGAAAT TTTAAATAAA
ATATTTAAAA AGTTAGATGA TTTTACTGGA CAAAATAGAC ACTTGGAAGA TGACGCCTCG
ATGGTTATTT TTCAATTGAA ATAG
 
Protein sequence
MTNYQKEKIF SNKFIKNFLE NESTEILKNK YKFAEIASSL AYYLKSFSNI NKLLDYISLI 
FKHIFSENII LIIPLNYEGD IWNENIKISV NDKYLTIQKE INKFLNQFHF SKNFKIKEIL
TFENALKNNF KEYKIETKKI ISRGKCRGFI YIFSKDIYIQ SITEDSNFNF IENCLAVGLE
NHYLLKTKKK HENVDREIST GAEIQSQLLP DYCPIIHGID LAAHCRPALQ LGGDYYDFMC
LKTNISEKRK EKSRWAFVIG DVMGKGIPAG LLMTMLRGML RAEVLTGLPP DRILHDLNQL
AINDLDQSHR FVTLFYSDYD PRTRKLRFAN AAHNPPLLWK SSDQKIIKLD AEGFVLGLQK
DAEYQCGEIK LNQNDLVLYY TDGVIDTSNS LGQRFDEERL IKTLTKFCKQ SYSSQEILNK
IFKKLDDFTG QNRHLEDDAS MVIFQLK