Gene NATL1_00101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00101 
SymbolrsbU 
ID4779737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp13337 
End bp14734 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content37% 
IMG OID640083273 
Productprotein phosphatase 2C domain-containing protein 
Protein accessionYP_001013839 
Protein GI124024723 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGAAA ATTCTCCAAT TTCAGAACAA CAAAATCAAC ATCCTACCAA CCTCTGCTCA 
AGTGCTGCTT CCAAGTCTCT AAGCGATTTG GTTCATAGCC TTTCTCAAGA GCAAGTTGTT
AATCAAGATT TATTGCTTTC ATTGAGCTTT GCTTTGCGAA GTTTTACTAA TTTACAGCGT
TTTCTTGAAC TTATTCCACT GCTTGTCACT CAATTAGTTG GCGTTAAAGG ATCATTGTTA
ATTCCATTTC AAGATAATGG AAGCCTTTGG CGGGAACAGT TACAAATGGT CCCTATAGAT
GAAGATCAGG AACTCTTTAG AAAATTATTT CTTTTAGAGA AAGGGAAGAA AACGGGTTTT
GGCATGGAGG AAAAGAATAT TGAAATGTTA GATGGTTTAG TTCAAAGACA TCTTGATTCT
TCCAATGTAA TTGCTACATC GATAGTTTCT CGAGGAAGAC AGCGAGGTCG ATTATATGCG
TTTGATAAAA AAGAGATTGT TTTTGGTAGT AATGTTCATC GCAAACATAT TCAAATAGTT
GCTGATCTTG CTGGAGTAGC TATAGAAAAT GATGCAATTT TTCAGGTGAT TCGTAATCAT
GAAAAAGTTG ATAGACAAAT AAGTATTGGT GCTGAAATCC AATCACAATT ATTACCTGAT
CAATGTCCAG TAATTGAAGG AGTTGAATTG GCTGCTTGCT GTAGGCCAGC TTTTCAAGTA
GGTGGCGATT ATTACGATTT TATGCCCACT CGCTCAGATT TAAATGAAAC AGCAAAAGCT
AGTGGGCGTT GGGCTTTTGT GATTGGTGAT GTTATGGGTA AAGGTGTTCC TGCTGGATTG
TTGATGACTA TGTTAAGAGG AATGCTTAGA GCGGAAGTTT TAACAGGACT ACCCCCTGAT
TCTATTTTGC ATGACCTGAA TCAATTAGCT CTTGAAGATC TGACTCAATC ACATAGGTTT
GTAACTTTGT TTTATTCTGA CTTTGATGCT AAATCAAGAA AATTACGTTT TGCTAATGCA
GCACATAACC CTCCTTTGCT TTGGAGCTCC AAGACAAAAT CAATTAATAG ATTAGATACC
CCTGGTTTGT TAATAGGTCT TCAACCTGAA GCTGAATATG GATGTGGTGA GATATTTCTG
CAGCCTGGCG ATGTCCTTCT TTACTACACC GATGGAGTTA CTGAGGCACC CGGGATATCA
GGTGAACGTT TTGATGAAAA TCGTTTAATA ACTTTTTTAG ATAAATTTGC TAAGGAAGGT
TTGGGAGCCA AACAAATATT AAATAAACTT TTTGAAAGAT TAGATGGTTT TGTTGGTGTT
AGTGATCATC ATCTTGAAGA TGATGCATCA ATGGTGGTTT TAAAAGTCAA AGAATCAACG
GTTCCTGAAT TAAATTAG
 
Protein sequence
MKENSPISEQ QNQHPTNLCS SAASKSLSDL VHSLSQEQVV NQDLLLSLSF ALRSFTNLQR 
FLELIPLLVT QLVGVKGSLL IPFQDNGSLW REQLQMVPID EDQELFRKLF LLEKGKKTGF
GMEEKNIEML DGLVQRHLDS SNVIATSIVS RGRQRGRLYA FDKKEIVFGS NVHRKHIQIV
ADLAGVAIEN DAIFQVIRNH EKVDRQISIG AEIQSQLLPD QCPVIEGVEL AACCRPAFQV
GGDYYDFMPT RSDLNETAKA SGRWAFVIGD VMGKGVPAGL LMTMLRGMLR AEVLTGLPPD
SILHDLNQLA LEDLTQSHRF VTLFYSDFDA KSRKLRFANA AHNPPLLWSS KTKSINRLDT
PGLLIGLQPE AEYGCGEIFL QPGDVLLYYT DGVTEAPGIS GERFDENRLI TFLDKFAKEG
LGAKQILNKL FERLDGFVGV SDHHLEDDAS MVVLKVKEST VPELN