Gene NATL1_17331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17331 
Symbol 
ID4779097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1418931 
End bp1420040 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content36% 
IMG OID640085020 
Productintegral membrane protein 
Protein accessionYP_001015553 
Protein GI124026438 
COG category[R] General function prediction only 
COG ID[COG4956] Integral membrane protein (PIN domain superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.07903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACC TAGTTATTCT GGTTTTATTT CTTATATCAG GTGCAATCAC CGGTTGGATT 
GGAGTGAATT GGTTACCTGA AGAGACTTTA GATCATATAA CTAATCTAAG AAATTTAAAA
ATTGCTTTAT CAGGATTAAC CGCTCTTATA GGCCTGTTAA TAGCCTTCCT TTTTCAGCAA
TTTAGAAATA AATTAACTAA AAGAATAAGA ACTATGCCAA CAGATCTACT AGTTAGTAGA
TCTGTTGGCA TAGTTCTCGG ACTTGTCATA GCAACACTTT TATTAGTTCC TGTACTACTT
TTACCTTTAC CCTCTGAATT GTTTTTTGTA AAACCTATTT TTGCTGTACT TAGCAACATT
TTTTTTGGAG TACTTGGATA TAACCTTGCG GATGTTCATG GACGAACCGT TTTACGTCTA
TTCAACCCAA ATAGCACTGA ATCTTTGTTA GTTGCAGATG GGGTTTTAAC TCCAGCTAGT
GCAAAAGTTT TAGATACCAG TGTAATAATT GATGGTCGCA TTCAGGCCCT CCTTAGATTT
GGGCTTATAG AAGGGCAAAT AATCGTAGCT CAATCAGTTA TGGATGAGCT TCAAAAACTA
TCTGACTCAA GTAACAACGA AAAAAGAGGA AAAGGAAGAA GAGGACTAAA ATTACTTAAC
CAATTAAGAG AAAGTTATGG AAGAAGATTA GTTATAAACA GCACTAGATA TGAAGGGGAA
GGAACTGATG AAATTCTTTT AAAATTAACC TCAGACATCT CAGGAATATT AATCACTGTC
GATTACAACT TGTCTCAAGT AGCTCTAGTC CAAGAAATAA AAGTTCTTAA CTTAAGTGAT
CTAGTACTTG CAGTTAGGCC TGAAGTTCAA CCTGGTGAGA AGTTAAATTT AAAAGTAGTT
AGAGAAGGTA AGGAAAACTC ACAAGGCATT GCTTATCTTG AAGATGGCAC AATGGTTGTT
ATTGAGGAGG GTCTCCAATG GATCGGTAAA AGGATAGAAG TAGTTGTGAC TGGAGCATTA
CAAACACCGA CGGGCCGAAT GGTTTTTAGT AAAGCTTCAA ATGATCAGCC CCCGAACAAA
TTTGAAAAAA CAAAAGCATC TCAAGGCTAG
 
Protein sequence
MADLVILVLF LISGAITGWI GVNWLPEETL DHITNLRNLK IALSGLTALI GLLIAFLFQQ 
FRNKLTKRIR TMPTDLLVSR SVGIVLGLVI ATLLLVPVLL LPLPSELFFV KPIFAVLSNI
FFGVLGYNLA DVHGRTVLRL FNPNSTESLL VADGVLTPAS AKVLDTSVII DGRIQALLRF
GLIEGQIIVA QSVMDELQKL SDSSNNEKRG KGRRGLKLLN QLRESYGRRL VINSTRYEGE
GTDEILLKLT SDISGILITV DYNLSQVALV QEIKVLNLSD LVLAVRPEVQ PGEKLNLKVV
REGKENSQGI AYLEDGTMVV IEEGLQWIGK RIEVVVTGAL QTPTGRMVFS KASNDQPPNK
FEKTKASQG