Gene NATL1_20341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20341 
Symbol 
ID4779073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1677646 
End bp1678959 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content38% 
IMG OID640085327 
Producthypothetical protein 
Protein accessionYP_001015854 
Protein GI124026739 
COG category[S] Function unknown 
COG ID[COG4370] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03492] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.433263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAATCCT CTGCCTTACC ACTTGGCCAT GCCGCCGAAA ATGAAAGTTT CATTTCCAAT 
TCAAATCGTA TCAGTCAAGC CGAACAAAAT CCTTTGTTTC TTTGTAATGG ACATGGAGAA
GACATGATTG CATGCAGGGT TATAGAAGCT CTCCATGAAA TGAATCCAAA TATCTCCCAA
GAAGTTCTCC CAATGGTTGG AGATGGGAAA GCATTTTTAA AGCATGTAAA AAATGGTTGG
CTTGCAAAGA TTGGCACCTC AATATTTTTG CCAAGCGGAG GATTTAGTAA TCAGAGTTTT
AGTGGTTTGG TTTTAGATTT AAAAGCTGGA TTGTTAGGAA GTCTTTGGGT TCAATGGACT
TTGACTCAGA GAGCAGCCAA AGAAGGAAAA ATTATCGTTG CAGTTGGAGA TTTATTACCT
CTTCTTTTTG CATGGGCTAG TGGGGCAAAT TATTTTTTTA TTGGCACTCC TAAAAGTGAT
TACACATGGG CCAGTGGTCC AAGATCCGCT TTAAGTGATT GTTACCATCG ATTGAAAGGA
ACTGAGTGGG ATCCATGGGA ATATTGGTTA ATGCGATCTA GTCGATGCAA GATGGTTGCA
GTAAGAGACA AAATCACTGC TAGAGGTTTG AGAAATCACG GTGTAAAGGC ACTGTCCCTG
GGAAATCCAA TGATGGATGG AATTTCTAAA AGAGAATGTC CTAATGACTT TAAAAGATAT
AGGCGTTTGA TTTTGTTATG TGGAAGTCGT TTGCCTGAGG CGTATCAGAA TTTTAAAAAA
CTTTTAATTG CAATCCAGTT TATTCGAATT AAATCTTCCA TTGCAGTCTT TGTTCCTTTA
AGTTCTTCTG CAATGAGAAA AAAAATAGCA TTAATATTGA TGGACTTAGG CTTTAAATCT
ACTTATCAAT CAACAGGTCA AAATGGGATT TCAGAAATAT GGAAAAAAAA CTCATTACTT
ATATTGATTG GCTTTAACCA ATTTTCTTAT TGGGCTAAGT GGGGAGAAGT AGGAGTAGCT
AATGCAGGTA CAGCTACAGA ACAAGTAGTA GGTTTAGGAA TCCCATGCGT TTCCTTGCCA
GCAAAAGGAC ACCAATTTAA TTTCAATTTT GCTAAGCGTC AAAGTCGTTT ATTAGGAGGA
TCGGTGGCTA TTGCTAAGAG TTATGAAACT CTCGCAAAAC AAGTAGAGTT TTTACTGAAC
TCTGATTTTG ATAGAGAGAT TATTGGCTTA AGAGGGGCTC AAAGAATGGG TCCAGAGGGC
GGAAGTCACT CTATAGCACT TAGTATTTCG AATCACTTGT CCCAGGGCTC ATAG
 
Protein sequence
MQSSALPLGH AAENESFISN SNRISQAEQN PLFLCNGHGE DMIACRVIEA LHEMNPNISQ 
EVLPMVGDGK AFLKHVKNGW LAKIGTSIFL PSGGFSNQSF SGLVLDLKAG LLGSLWVQWT
LTQRAAKEGK IIVAVGDLLP LLFAWASGAN YFFIGTPKSD YTWASGPRSA LSDCYHRLKG
TEWDPWEYWL MRSSRCKMVA VRDKITARGL RNHGVKALSL GNPMMDGISK RECPNDFKRY
RRLILLCGSR LPEAYQNFKK LLIAIQFIRI KSSIAVFVPL SSSAMRKKIA LILMDLGFKS
TYQSTGQNGI SEIWKKNSLL ILIGFNQFSY WAKWGEVGVA NAGTATEQVV GLGIPCVSLP
AKGHQFNFNF AKRQSRLLGG SVAIAKSYET LAKQVEFLLN SDFDREIIGL RGAQRMGPEG
GSHSIALSIS NHLSQGS