Gene NATL1_12151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_12151 
Symbol 
ID4779617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1063716 
End bp1065185 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content37% 
IMG OID640084494 
Producthypothetical protein 
Protein accessionYP_001015038 
Protein GI124025922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATA AAGACTTACC ATCGATATGT GGGTATGAAT TTGCAATTAA CGAATTAGTA 
AATAGGGATA GAGATAGTAG ACAATTCAAA GAAAATTTAA ATTTCGAGAA AGTCAATTCA
GGATTTGCAT GTGCCCTCCA TATGCATCAA CCAACAATCC CAGCAGGTGA GAATGGAGAG
CTCATATCAC ATTTGCAATA TATGTTTGAG CACACCTCAG AAGGAGATAA TCACAATGCC
GAACCATTTG CTCAATGCTA TAAACGTCTA GCTGAAATCA TTCCAAGCCT GATAAAAGAT
GGACATGATC CGAAAATAAT GCTTGATTAT TCCGGAAATC TTCTATGGGG ATTTGAACAG
ATGGGTCGAG AGGATATTCT ATCCTCGTTA AAACTACTGA CCTGTGATGA GACAATTTAT
CCACATGTTG AATGGCTAGG AAGCTTTTGG AGTCATGCTG TAGCCTCTTC AACACCACCC
TCTGACTTTA AATTACAGAT AACAGCTTGG CAACATCACT TTTCAGCATT GTTCGGAGAA
GATGCATTAC GTCGAGTAAA TGGATTTTCT CTTCCTGAGA TGCATCTTCC AAACCATCCT
GATGTTCTTT TTCAATTAAT AAAAGCTCTT AAAGAATGTG GGTATCGATG GTTAATGGTT
CAAGAACACA GTGTTCAGAA TATAGATGGC TCAAGTCTCA GAGACGATCA AAAATATATT
CCCAACATGC TCAAAGCTCA AGCAAGTAAT GGAGATACGA TCTCTATACT TTCACTAATA
AAAACTCAAG GGTCAGATAC AAAGCTTGTT GGTCAAATGC AACCTTATTA CGAGGCACTA
GGGCTATGTA AACAAAATTT AGGTCAACAT ATTATTCCGA AGCTGGTTTC TCAAATTGCT
GATGGAGAAA ATGGCGGAGT GATGATGAAC GAATTTCCTC AAGCTTTTAT TCAGGCACAT
AAAAGAATTG GCCCAAAAAC AAATATAAGT CCCACAATTG CTATGAACGG ATCTGAATAC
CTCAACTTCC TAGAGACTTC AAATGTAGAT GAAGATACTT ATCCAGTGAT ACAAGCAATC
GATCAACACA AAATATGGGG GAAAATATCC GGACCAATAA CACCAACAAA ATTTAAAAAA
GCAATTGAAG TATTAAAAGA GGAAGATCAA TCTTTTTCTT TGAGTGGAGC TAGCTGGACT
AACGACTTAA GTTGGGAAGA TGGATACAAT AACGTTTTAG AACCAATTTC AAAACTTAGT
TCATATTTTC ACGAAACATT TGACCATTTA GTAGCTCAAA ATCCATCGCT AACAAAAACG
CATAGTTATC AAAGAGCACT CCTTTACCTT TTGCTATTAG AAACTAGCTG CTTCCGTTAC
TGGGGACAGG GGAAATGGAC TGATTACGCT AAAACGATCT TCGAAAAAGG CGAAGAGGTA
CTTAGAAACA TAGAAATTTC ATCTAACTAA
 
Protein sequence
MKNKDLPSIC GYEFAINELV NRDRDSRQFK ENLNFEKVNS GFACALHMHQ PTIPAGENGE 
LISHLQYMFE HTSEGDNHNA EPFAQCYKRL AEIIPSLIKD GHDPKIMLDY SGNLLWGFEQ
MGREDILSSL KLLTCDETIY PHVEWLGSFW SHAVASSTPP SDFKLQITAW QHHFSALFGE
DALRRVNGFS LPEMHLPNHP DVLFQLIKAL KECGYRWLMV QEHSVQNIDG SSLRDDQKYI
PNMLKAQASN GDTISILSLI KTQGSDTKLV GQMQPYYEAL GLCKQNLGQH IIPKLVSQIA
DGENGGVMMN EFPQAFIQAH KRIGPKTNIS PTIAMNGSEY LNFLETSNVD EDTYPVIQAI
DQHKIWGKIS GPITPTKFKK AIEVLKEEDQ SFSLSGASWT NDLSWEDGYN NVLEPISKLS
SYFHETFDHL VAQNPSLTKT HSYQRALLYL LLLETSCFRY WGQGKWTDYA KTIFEKGEEV
LRNIEISSN