Gene NATL1_03711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03711 
Symbol 
ID4779465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp344357 
End bp345754 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content40% 
IMG OID640083639 
Producthypothetical protein 
Protein accessionYP_001014200 
Protein GI124025084 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.291486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACAAAA GAAGAAAAAG AAAATCAAGA AGAGCCACCT TGCGTAAAAG CCTTCTCGCT 
AGCTTGTCTT TTCATGTTTA CTCTCGCTTC AAAAGAGCTA TTAGATGGCT GTTGCCTGGA
TTAGTAGTCA AAAGGTGGAT GATGACATCT GGATTGGGCT TGTTAATTGC TTTGATTGGA
GCCTCTATTT GGGCTGACTT GCGTCCAATT TATTGGGTAG TTGAAATACT TTTTTGGTTT
TTAGGATTCA TAACTACTTT TTTACCTAGA ACTATTTCTG GACCTTTAGT CTTTTTTATA
GGGATATCTC TTTTGATTTG GGGTCAAGGG AGAAGTTTTG AATCTATTAG GCAGGCTTTA
GGTTCAAAAA AAGATACTTT TTTGGTTGAT GCTTTAAGAG CTAAGCAAAA ATTAAATAGA
GGTCCAAACA TCGTTGCTAT TGGAGGAGGA ACAGGTTTAT CTTCATTATT AAAAGGCTTA
AAAAGATATA GCAGCCGAAT TACGGCAATA GTTACTGTTG CTGATGATGG AGGAAGTAGT
GGAGTTCTTA GGAGAGAACT GGGTGTTCAG CCTCCAGGAG ACATAAGGAA TTGTTTAGCA
GCTTTAGCAA CAGAAGAACC GCTTATTAAA GGACTTTTTC AATATCGCTT TCCCTCAGGT
AGTGGACTTG AGGGACATAG TTTTGGGAAT CTTTTCCTAT CTGCCTTGAC TGCGATAACT
GGAAGCTTAG AGACCGCAAT TACTGCTTCA AGTCGTGTTC TTGCAGTTCA GGGTCAAGTT
GTACCAGCAA CTAATGTAGA TGTTCGTTTG TGGGCGGAAC TCGAAAATGG GGACCGAATA
GATGGAGAGT CAGCCATAGG GAAAGCCCCC TTGCCAATAA TCAGAATCGG CTGTTACCCA
TCGCGACCCC CAGCTTTGCC AAGAGCTCTA GAAGCCATAA GAAATGCTGA AATTATTTTG
ATAGGTCCAG GAAGTTTATA TACTTCAATT CTTCCAAACC TGCTTGTCCC TGAAATAGTC
GAGGCGATTG AGAAGAGCAA AGCTCCTAAA CTATATGTAT GTAATTTGAT GACTCAACCT
GGAGAAACTG ATGGACTTGA CGTAACTGGT CATGTAAGGG CCATAGAAGC ACAATTGGCA
TCAAGAGGAA TTTCTAGGAA AATATTTAGT TCAATACTTG CTCAAGATGA ATTAAAACCA
TCCCCATTGG TTGATTACTA CAAATCAAAA GGGGCTGAAC CAGTCAAATG CAACAAGATT
GATCTTTTAT CTAGAGGATA TAATGTTTAT TTAGCTTCTC TTCAGGGTTC AAAGGTCACT
CCAACTTTGA GGCACGACCC AAGAAGTCTC GCTTTAGCAA TAATGAGGTT TTACAGAAGA
TACAAAAAAG GTAAATAA
 
Protein sequence
MNKRRKRKSR RATLRKSLLA SLSFHVYSRF KRAIRWLLPG LVVKRWMMTS GLGLLIALIG 
ASIWADLRPI YWVVEILFWF LGFITTFLPR TISGPLVFFI GISLLIWGQG RSFESIRQAL
GSKKDTFLVD ALRAKQKLNR GPNIVAIGGG TGLSSLLKGL KRYSSRITAI VTVADDGGSS
GVLRRELGVQ PPGDIRNCLA ALATEEPLIK GLFQYRFPSG SGLEGHSFGN LFLSALTAIT
GSLETAITAS SRVLAVQGQV VPATNVDVRL WAELENGDRI DGESAIGKAP LPIIRIGCYP
SRPPALPRAL EAIRNAEIIL IGPGSLYTSI LPNLLVPEIV EAIEKSKAPK LYVCNLMTQP
GETDGLDVTG HVRAIEAQLA SRGISRKIFS SILAQDELKP SPLVDYYKSK GAEPVKCNKI
DLLSRGYNVY LASLQGSKVT PTLRHDPRSL ALAIMRFYRR YKKGK