Gene NATL1_20251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20251 
Symbol 
ID4779683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1669758 
End bp1671143 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content38% 
IMG OID640085318 
ProductSignal transduction histidine kinase 
Protein accessionYP_001015845 
Protein GI124026730 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.617847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC CTCTAGTAAC TATTAAAGCC TTACAAAGAA GAATGGCTGA GGGTGTTCCT 
CATGCCACAA GCAGTGAATC CGCAGTAAGG AGAATGTGGT GGGCAGCTCT TGATACTCTT
CAGTCAGACA TTTTGTTGCC AATGAACCTT ACAAGAGGTT TGTGGTTATC TGCACCGTTG
CCTGCACTCT ACGAACCTAA ATTACTCAAG AAATTTCAAG GATGGGTGTG GGCACCAAAG
GACTTGTTAA ATATTTCAAA CCCTTCTATG GGGATGCTGC CTCCAAGTAA ATCGGCTTCG
ATGGATTTCC ACAATGATTC TGCAAGCTAT GAGCGCTTGT CTCTTCTTGA GGAAGATGGA
AATGATCCAT TACTAATAGT TATTACTCCT GAAATTCAAA TAGCTTTGGC TCTGGAAGGT
AAATCGAATG AGAGAAAATT ATTAATGCGA AGTGATCCCG AGACTCTAAG TGATCTTTTG
ACACTACTTG ATAATAGATT GAATACAGAA AATGTTGAAC AAGCAAATAA TCTGCGAAAT
GCACTTGGAG AGATGGGACA GCTAAAAACA AATGATGATT TATCTAAAGT ATTTTGGCCT
CTATTATCTC AACGTCTAGC AGACATTGCA CCAAGTCTAA ATATTCAAAC TTTGCCAGAC
AATTTAATTA ATGATCACAA ATCTAGTTCA AAAAATAGTG AAATTTCCTT GCTAGAGGCA
TTAACTCATG AAATCAGAAC TCCGTTGGCG ACAATAAGAA CCCTAATAAG ATCTCTTTTA
AGAAAGCAAG ATTTACCTCA AGTAGTTGAA ACGCGTTTGA AGCAAATAGA TATTGAATGT
ACGGAACAAA TTGATCGTTT TGGTTTGATT TTTAATGCAG TGGAGCTAGA GAGAAGCAAG
CCTAAACAAA CAAATTTAGC TTTAACCGAT TTAGGGCAAA TGCTCACAAT GCTTTCTCCC
GTGTGGAGAA GTCAGTTAGA GCGAAAAGGC TTGACACTTA TTCTTGATAT CACACAGGAT
TTACCAAAAG TTTTGAGTGA CTCAGAAGGA CTTGAATTAA TGTTGACAGG TCTTATTGAC
AGAAACAGTC GTGGATTACA AGTGGGTGGA GAATTAACTT TGAAATTAAG GCCAGCTGGA
CAGAGACTAA AGCTTCAGAT TTTAACTCAT CTTACAGCTA CTACTCATTC TGGATTATCA
GAAAGTGTTT CCAATGAAGA AATTGGACCA GTACTCAGTT GGAATCCAGC TACAGGTAAT
TTGCAACTTA GTCAGGCTGC AACACAAAGA CTTTTGAAAA GCCTTGGTGG ACGTCTTACA
AATAGGCGAG ACAGTGGAAT GACGATATTT TTTCCTGTTT CTGAATCAAA AGAACTTGAT
CTTTAA
 
Protein sequence
MSTPLVTIKA LQRRMAEGVP HATSSESAVR RMWWAALDTL QSDILLPMNL TRGLWLSAPL 
PALYEPKLLK KFQGWVWAPK DLLNISNPSM GMLPPSKSAS MDFHNDSASY ERLSLLEEDG
NDPLLIVITP EIQIALALEG KSNERKLLMR SDPETLSDLL TLLDNRLNTE NVEQANNLRN
ALGEMGQLKT NDDLSKVFWP LLSQRLADIA PSLNIQTLPD NLINDHKSSS KNSEISLLEA
LTHEIRTPLA TIRTLIRSLL RKQDLPQVVE TRLKQIDIEC TEQIDRFGLI FNAVELERSK
PKQTNLALTD LGQMLTMLSP VWRSQLERKG LTLILDITQD LPKVLSDSEG LELMLTGLID
RNSRGLQVGG ELTLKLRPAG QRLKLQILTH LTATTHSGLS ESVSNEEIGP VLSWNPATGN
LQLSQAATQR LLKSLGGRLT NRRDSGMTIF FPVSESKELD L