Gene NATL1_02361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02361 
Symbol 
ID4779559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp218139 
End bp219725 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content27% 
IMG OID640083501 
Producthypothetical protein 
Protein accessionYP_001014065 
Protein GI124024949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC TTCAATCTTA TTTTTTAATA TCTGCGATAG TAATTTTATC AATTTTGACT 
GGGATATTTA TCTGGCGCAA TAAGCATCTT ACGCAAATCC CTAAGTTCAA TGAAGAATCA
TTCAATGCTC CGGTTTCCTC AAAATATATA CCAAAGAATA CTGATCTCGT ATTCCACTGG
AAACTGAATC CAGGCTTACT TCCAAAGTAC ATCGAAAATT ATCAAGATAA AGTTAGTAAA
CACGCCATAA ACAAAAAAGT AAGTTTTATT AGAGATTCCT CTTTTCAATT AATTGGCTTT
AATTTTGCAA AAGACATCTC AAAATGGGTA GGAGATTATG GGAGCTTTGC AGTATTTGAT
TCAAACAAAA AAACTATAAA TGATTGGTTG ATGGTCTTAG CAATAAAAGA AGATGTAAAT
ATTAAACAAG AATTAGAATC TATTTTAGGA TCAAAAGTTG TTGATGAGAG TACTACTCAA
AGCAATAAAA TCAGCACCTC AAAAACAGAA ATAATTTCAA AACAAATTAA TTCAAATAAC
TCAATCTACT TTGCAAATGA TGAAGATAAT CTTTTAATAT CATCCAATCC TAATATCATA
CAATCTTCAA TAGAAGAATT AGATAGCAAT ATAATAAATA CAAAAAAAAT GTATAAGAAT
ATTCAATTAA AGGATAATCT TAAAGACGGA TTATTATTAT TAGAAATGTC TCCAAAAAAG
ATTTTAAATC TTATTGGTCA AGAAGAAGAT TTATTGAATA TAAATAAGGT AGATAATTTA
CTATCTTCTG TAAATGTAGA TAAAAATAAA TTAAACTTAG AAGGAATATT AGCTTACAAT
GTTAAAACTA AAATGCCAGT TAAAGATATT AATTCTAATT TAATTGATAT AAAAAAGGAA
TCTGAATTGC CTGAGAATTA TATATTAGTT GACAATCCCA AGCAGTATTT CCAGAAAGAT
TCTGTCCATC CATATCAAAA GCTAATAGCC TCTATTATCA AAGAATCAAC AACCTCAGAT
TATTCTGAGC TCTTAAAAAT AATTCTTGAA AACTCTCAAG GAAATTTGAT TTGGATAAAT
GATAAAGACT GGTTGATTTT AACTAGGAAA TCTGATACGA AAAAGACAGA GATAGATGAT
ATTCTAAAAA AAGAGAATTT TTTGAATTCA AATCTAGATT TTAAAAGCAG AAAGCTAGAG
ATTTGGTCAA AAATAAGTAC AAATGAAAAT AATACATATG AGCTAAAAGA TAACATTGAG
GCAATTGTCG AAGAAGATGA CAAGACTTAC ATTTGGAGTC AAAACTTATC TTCTATATCA
AATTTTGATA ATACAAACTA CCTAAAAAAT TATTCAGATA ATGAACAGAA TACAAATGAA
TTTAATGATT TTGATGATAT CTTGAAAATT CATTTAGGGA AAGAAAAAAC TAAAGCAATT
TTAAATAGTT TCTATCCATA TATCTTATTG AAAACTATGT TAGGAAACAC ACTAAATCCT
CCTCAGGATA TTGATATAGC CATTGCAGTC CCTACAATTA ATTATCCAGA CTTCATTAAA
GTTAAAATCA ACTTAAAAAC AAGTTGA
 
Protein sequence
MKSLQSYFLI SAIVILSILT GIFIWRNKHL TQIPKFNEES FNAPVSSKYI PKNTDLVFHW 
KLNPGLLPKY IENYQDKVSK HAINKKVSFI RDSSFQLIGF NFAKDISKWV GDYGSFAVFD
SNKKTINDWL MVLAIKEDVN IKQELESILG SKVVDESTTQ SNKISTSKTE IISKQINSNN
SIYFANDEDN LLISSNPNII QSSIEELDSN IINTKKMYKN IQLKDNLKDG LLLLEMSPKK
ILNLIGQEED LLNINKVDNL LSSVNVDKNK LNLEGILAYN VKTKMPVKDI NSNLIDIKKE
SELPENYILV DNPKQYFQKD SVHPYQKLIA SIIKESTTSD YSELLKIILE NSQGNLIWIN
DKDWLILTRK SDTKKTEIDD ILKKENFLNS NLDFKSRKLE IWSKISTNEN NTYELKDNIE
AIVEEDDKTY IWSQNLSSIS NFDNTNYLKN YSDNEQNTNE FNDFDDILKI HLGKEKTKAI
LNSFYPYILL KTMLGNTLNP PQDIDIAIAV PTINYPDFIK VKINLKTS