Gene NATL1_01391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01391 
Symbol 
ID4779597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp136586 
End bp137668 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content38% 
IMG OID640083403 
Producthypothetical protein 
Protein accessionYP_001013968 
Protein GI124024852 
COG category[S] Function unknown 
COG ID[COG3330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTTTG ATCAAGAATC CCTGTCACGT TTAACACTTC GTCAGCTTCG TACAAAAGCT 
AGTGAATTAG GTATTCCTCT TTATAGTAGA AAATCTAAGG CCGATTTAGT TAAGGGTGTA
TTGCTGTACG AAGAAAAAAA GGAATTAGAA AAACAGTTGA TAAACAATAA AGTCCAACCA
TCAAGCGAAA CTACATATCA AAATTCATCA GAGACCAAAG TCGTTTTTCT TCCTCGTGAT
CCCGAGTGGG CATATATATT TTGGGAGATA TCAGATTCTG ATCGTTCTAA TGCTCAAAAA
GAAGGTGCTA TTAGGCTTTG TTTGCGTTTA GCTGATGTCA CCAATAAAAA TAATGGAGAG
ACTAATCCTG GAACTCTTCA AGAAATTGTT GTTGATAGTC ACAGTACGGA GTGGTACTTA
CCTATTCCTT TAGCTGGAAG AGACTACAAG GTTGAACTCG GTTATCGAAT TGGTCATAAA
TGGATGTCAC TCGCTCATTC ATCTTCAGCC AAAGTACCTT CACTTCATCC AAGTGAGCAA
ATTCTTGATC AATTTGTTCC TTTTAGTCTA GAAGCCCCAG TTACTACTTC CTCTGATTCT
AAGATAGAAA GTTTTGCATC AGAACAACCA GACAGTGGTT TGCATGAGCG TTTATATCAA
TCAGCGACCA CAAAATTTAG AACTAGAAGA GTTGGTTCAG AAGAATTCCA AGAGGGTTTC
CCAGGAGATC TAAATTCAAA TAATGAATCT GGTAGTGGGC TTTGGGCTAG TGGCTTGAAT
GAATCTGGTA TTGGTGGGGT TCCTCAAGCT CGTTCTTTTT GGTTGGTTGC TGATGCGGAA
TTAATTGTGT ATGGAGCTAC TGATCCCTCA GCTAAATTGT TTATCGAAGA TGAAGAGGTC
CCACTAGGAA ATGATGGAAC TTTTAGATTG CAAGTCCCAT TCAGAGACGG TATTCAGAAC
TATTCAATTA AAGCTATTGA TAAAGATGGT GTTGATTCAA GGAACATAAC AATGAAATTC
GAAAGAGTTA CTCCAGTTGA TAACACTAAC CCAAATTCCA AAGCTGAATC AGAATGGTTT
TAA
 
Protein sequence
MTFDQESLSR LTLRQLRTKA SELGIPLYSR KSKADLVKGV LLYEEKKELE KQLINNKVQP 
SSETTYQNSS ETKVVFLPRD PEWAYIFWEI SDSDRSNAQK EGAIRLCLRL ADVTNKNNGE
TNPGTLQEIV VDSHSTEWYL PIPLAGRDYK VELGYRIGHK WMSLAHSSSA KVPSLHPSEQ
ILDQFVPFSL EAPVTTSSDS KIESFASEQP DSGLHERLYQ SATTKFRTRR VGSEEFQEGF
PGDLNSNNES GSGLWASGLN ESGIGGVPQA RSFWLVADAE LIVYGATDPS AKLFIEDEEV
PLGNDGTFRL QVPFRDGIQN YSIKAIDKDG VDSRNITMKF ERVTPVDNTN PNSKAESEWF