Gene NATL1_20971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20971 
Symbol 
ID4781237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1741129 
End bp1742310 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content29% 
IMG OID640085393 
Producthypothetical protein 
Protein accessionYP_001015917 
Protein GI124026802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.468089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACGT TTAATGATAT AAAAAGTTCA GAGCTTATAA TACACACAAG AAAAGTTCTT 
CTTAATAACG AAGGGGAGAC TATTAGTTAC TATATAGATA AAATAGAAGG AACGCAATAT
CTAGATAAAT ATTATATAAA TTCTGGAATA TCGTACAAAG AGAGTAATTA TATAACCCTT
GATTCACGTT TACATTCTAT TGAAGAAAAG AGTTTTTTAC GATCAGTATT TAGGAGATTA
GATGAAGAAT TAGATCTTGA TTTTTTTGAG ATGTCTCATA ATAATGGATC AGATATTGAT
ATTTTTCATG TAAATAGTTC TTCAATTTTT GATACAAATA CTATAGGCCA AGCAATAAAA
CAAGAGCATC AATCAGGAGC ATGGTGGGAA TTATTTTGGA AAGACAGTGA CGAATTGAAA
AAATTTGGAT CTTTAGAAAA AAATACAATC ATTCATGAAA TCGGTCACGC ATTAGGTTTA
GCTCACCCTT TCAATGATCC TTTTAACAAA AATTACACGA CTCAAGACAC AATAATGTCT
TACAATAGAG GTCCATCTGG ATGGAATGAA TGGTTTTCTA GTATCGACTT GCTTGCTCTA
AAAAGTATTT GGAAAAGAGA AGATGATTTA GGAATAATAG AATATGAAAA CCCAAGTAAC
AGTTACAAGT TTATTCGTGA AAATAATGAT TCATTATTTA TAAAAAGTGA GATAGGTAAT
GAGTTGATTG ATGGCATACA AAATTTACAT TTCAGTGATC AAATTCTCAA CGTTAATGAA
GACATACTCA GTGTATTCAA TGAACTCAAA GGAATTGACC ATATTACAGG ACAAATATAT
AGACTATATA ATTCTGCTTT TGCAAGATTC CCTGATATAA ACGGTTTCAG ATATTGGATA
GAAATGAATG AATCTGAAAA TAATACATAC TATCAAACAT CTGCTTCATT TATTAATTCA
GCTGAATTTA AGAAATTGTA TTTTAATGAT CAATCGAACG AAGCATATAT ATACTCACTT
TACAACAATA TTTTTAAGAG AGAGCCTGAT ACTGATGGCT ATGAATATTG GCTTGGACGA
ATTGAAGGTA ACCACGAAAA TAAAAATGAT TTATTAATTG GATTTGCAGA ATCCATGGAA
AGCAAAGAGC TATTTATGAA AGAAACATCT TTAAAATTTT AA
 
Protein sequence
MLTFNDIKSS ELIIHTRKVL LNNEGETISY YIDKIEGTQY LDKYYINSGI SYKESNYITL 
DSRLHSIEEK SFLRSVFRRL DEELDLDFFE MSHNNGSDID IFHVNSSSIF DTNTIGQAIK
QEHQSGAWWE LFWKDSDELK KFGSLEKNTI IHEIGHALGL AHPFNDPFNK NYTTQDTIMS
YNRGPSGWNE WFSSIDLLAL KSIWKREDDL GIIEYENPSN SYKFIRENND SLFIKSEIGN
ELIDGIQNLH FSDQILNVNE DILSVFNELK GIDHITGQIY RLYNSAFARF PDINGFRYWI
EMNESENNTY YQTSASFINS AEFKKLYFND QSNEAYIYSL YNNIFKREPD TDGYEYWLGR
IEGNHENKND LLIGFAESME SKELFMKETS LKF