Gene NATL1_03481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03481 
Symbol 
ID4781140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp321778 
End bp322899 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content40% 
IMG OID640083615 
Producttwo-component sensor histidine kinase 
Protein accessionYP_001014177 
Protein GI124025061 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAA CGCAGTATTC AGAAAGGTTT TTGAATTTTG TTCAACAACA GTTGATGAGC 
TTTCAAGCTG ATCAAGAACT CGAGCATGTT GTTGTTTATG TAGCTAGATC TGGAGAAAGT
GGATCCCCTA CTTTGGAAGT TGTAGGCCAG TGGCCGAAGT CAGAGAAATT TTTACAGCCA
GTAGAAACTG ATACTGCCCT TCGCACGCCT TCTTCTAATA GAAGATGGTA CCCATTGCAG
GAGGGGTCAA TATTGTTAGG CGTTATACGC GCAGAGAGAT TCGCTACTGA AGAAGAATGG
CGAGAGTCTC TTGATCAACG CCTGCAATCA ATGTCAATTT TGATGGCTAA CTCTCTCGCT
TCTGAACTTG ATAGAAAAAG ATTATTAGAT CAATTAGATG ATCAAAAGGA GCAGATATCA
TTAATGGTTC ATCAGCTCAG AAATCCACTA GCTGCTCTTG GTACTTATGC AAAACTTCTT
TTGAGGAAAA TAGGCCCTGA AAGTGAAAAT GAAAACCTTG TAAAAGGTCT AATGAACGAA
CAAGCACAAG TTAATAAATA TCTTTCTGCG CTCGATCAAC TTAGTCAGGT AAAACTACCC
CAAGCTGATG ATGGATCAAA CAGATTGCTT TTGCCTCCAC TTTTGCCAAG TGAGACTTGC
ATCAGTGTAA AGAGCTTAAT AGAGCCATTG ATTGAGAGAG CTAAGGCTAG AGCTATTTTG
CAAGGCAGGA AATGGTATGG TCCTTCAAAA TGGCCATTAT GGATGGAAGC AAAATCAATT
TCAGAAGGAG TTATTGCTGA AATAGTTGCA AATCTTTTGG AGAACGCATT TCGTTATAGC
CCTCCTAAAG CCTCTATTGG TATTGAAGTA ATCGTAGAGG GGATATGTGT TTGGGATGAG
GGCACCCCAA TAAAGGAGGA AGAAAGAGAA AAGATTTTTC AGAAGGGTTT TAGGGGAGAA
AGTGGTTCAA AAATGTCTGG AAGTGGAATA GGTCTTGCTC TTGCAAGAGA TTTGGCTAGA
CAACTAGGTG GAGATTTACA GTTATTAGTT GATCCAAGTC AATTTAAAAA CTCTTTGCCT
GAATCAGGGA ATGCTTTCGT TTTTAATTTG GAATCAAAAT GA
 
Protein sequence
MTSTQYSERF LNFVQQQLMS FQADQELEHV VVYVARSGES GSPTLEVVGQ WPKSEKFLQP 
VETDTALRTP SSNRRWYPLQ EGSILLGVIR AERFATEEEW RESLDQRLQS MSILMANSLA
SELDRKRLLD QLDDQKEQIS LMVHQLRNPL AALGTYAKLL LRKIGPESEN ENLVKGLMNE
QAQVNKYLSA LDQLSQVKLP QADDGSNRLL LPPLLPSETC ISVKSLIEPL IERAKARAIL
QGRKWYGPSK WPLWMEAKSI SEGVIAEIVA NLLENAFRYS PPKASIGIEV IVEGICVWDE
GTPIKEEERE KIFQKGFRGE SGSKMSGSGI GLALARDLAR QLGGDLQLLV DPSQFKNSLP
ESGNAFVFNL ESK