Gene NATL1_05441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05441 
Symbol 
ID4780428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp491718 
End bp492944 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content31% 
IMG OID640083821 
Producthypothetical protein 
Protein accessionYP_001014371 
Protein GI124025255 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.307892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTA TTTTTATTGA TTGTAGTCTT GGAATATCTG GAGATATGCT CGCATCAGCT 
TTATTCGATT TAGGTGTTCC GCACTCTATT TTCTTGGATA ACTTAGTAAG TTTAAATATA
GATAAAAATT ATAAACTAAA ATTTAAAGAG GGAGATAGTG AAGGCATAAA AGGTATTGTC
TGTATGAAAA ATGAAATTCA ATTTAAGGAA TTATCTAGAA GTCTTAATGA GATAAAGAAC
TTACTTCTCA ATTCAAGCTT GAATGATTAT GTGAAGAAAA AGTCTATTAA GGTTTTTGAA
ATTCTTGCTG AAGCTGAAGC AGTTGTTCAC GGGAATCAAA TCTCAGATGT TCATTTTCAT
GAACTAGGTT CAATAGATTC CATCCTTGAT ATTGTTAATG TTTGCTCAGC TATAGATTTT
TTAAAGCCAT ACAAAATTTA TTTTTCAAAT CCACCTTCTG GGAAAGGAAT CGTATCCACT
TCACATGGCC CTCTGCCTGT TCCAGTGCCT ACTGTTCTAG AGATAGCGAG GCAAAATGAA
ATCCCATTAA TGGTGCTTGA TGATAAATAT TTTGGTGAAA TAACAACTCC TACTGGCATC
GCATTGATAG CAACTTTTAT AGATAAGTTT GGTCAACCAA GTAATCTAAA TATTCAAAAT
ATTGGTATTG GCTTAGGAAG TAAAAATATA TCTCGACCTA ACTTTTTACG TATTCTGCTA
ATAGATGAAA ATGATGATTA TATGGAAAAT AATAAACCCT CTAATGAAAC TATAATTGCT
CAGGAGGCTT GGATTGATGA TTCTACGCCT GAAGATGTTG CGGTTTTAAT AGATAGATTA
AGGTCTGCAG GTGCCATAGA TGTTATTTGT TATTCGGTCG ATATGAAAAA AAATAGAAAA
GGTATATGTA TACAAGCTAT TGTTTACCCA AAGCATAAAA ATTTACTGCG TGAAGTTTGG
TTTAACTATA GTACAACAAT TGGAATAAGA GAAAATAAGA TTAGCCGCTG GATACTTCCA
AGAAGAACAG TGAGTCATAA AACTAAATTT GGGACAGTTA ATGTTAAACA AGCAATGAGA
CCAAATGGTC TTAATTCAAT AAAAATAGAA CATAAAGACT TGACTCGAAT AACTTTAAAT
ACAGGAATTC CAATAGAAGA GATACGTCAG AAATTAATCA TAGAATTATC AGAATTTTAT
GAAATCGATG ATTGGTCTTT TTTATGA
 
Protein sequence
MKSIFIDCSL GISGDMLASA LFDLGVPHSI FLDNLVSLNI DKNYKLKFKE GDSEGIKGIV 
CMKNEIQFKE LSRSLNEIKN LLLNSSLNDY VKKKSIKVFE ILAEAEAVVH GNQISDVHFH
ELGSIDSILD IVNVCSAIDF LKPYKIYFSN PPSGKGIVST SHGPLPVPVP TVLEIARQNE
IPLMVLDDKY FGEITTPTGI ALIATFIDKF GQPSNLNIQN IGIGLGSKNI SRPNFLRILL
IDENDDYMEN NKPSNETIIA QEAWIDDSTP EDVAVLIDRL RSAGAIDVIC YSVDMKKNRK
GICIQAIVYP KHKNLLREVW FNYSTTIGIR ENKISRWILP RRTVSHKTKF GTVNVKQAMR
PNGLNSIKIE HKDLTRITLN TGIPIEEIRQ KLIIELSEFY EIDDWSFL