Gene P9303_17221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17221 
Symbol 
ID4777029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1507190 
End bp1508434 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content40% 
IMG OID640087231 
Producthypothetical protein 
Protein accessionYP_001017731 
Protein GI124023424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAT TTGAAATTAG ATCGCTGATA TCAATCTCAA TTGGCATACT TGTACTGTTT 
ATTGGCAAGC GCATCAATGG CAATATTCAA GTGCTGCAGA ATTTATCGAT TCCTGATTCT
GTTACAGGCG GACTGATTGC AGCACTAGCA ATCACACTTG TGAAATGGAC AACGGATATA
GAGGTTATAT TTGATCTTGG TGCAAGAGAT TTCTTGCTTA TTTATTTCTT TACTACGGTT
GGTATAAGTT CAAATTTAAT CAACATGAGA AGGGGTGGGA AAGCCCTGGG AATATTACTC
GGTCTGGTTA TTGCTTTCAT TACGATTCAG AACATTTTCG GTGAATGGAC AGTTAATCTA
TTTGGACTCC AAGAAGGCCT TGGACCACTT TTGGGTTCAG TTTCGCTTGC AGGTGGCCAT
GGTGTGACCA TCGCATGGGC GCCAACCTTT GCAAAACAAT TTGGCATCAA GAATGCTTTG
GAAATTGGCG TTGCAGCATC TACATTTGGG TTGTTGGTTG CCAGCCTAAT AGGTGGGCCA
ATCGCCAATT ACTTGATTCG AAAAAATCAT CTGAATTTAG ATCATAAAAA AGAAATATAT
AATGAAAACA GGAATGGAGA AGAAAATACA GACAAGACGG AATTGAATAA TCAATTACCA
TCAAAATTGG ATTACAACAG TATTCTTTCG TCAGTGCTGG CTGTCAATAT CTGTATTATT
CTCGGGAAAA TATGCCAGGA ATTTCTTTTT AAGTTAGATA TTCAACTGCC ACTTTTTGTT
GTTTCTATGA TCGTTGGAAT CATTTTGAGC AGCATTTTGC CGGATCGTAA GATTTTAAAT
GATTTTCTAA TTTGGCCTAA GCAAACAGCT GCTTTATTAC TTCTGGCAGA GCTTTCCCTT
GGCACATTTC TGGCAATGTC AATCATGAGT CTGCAGTTTT GGGATCTCTT GGATCTCCCT
CCGGCCTTGC TGTTTATATT GGTTTTCCAA ACTATCCTCT CTATCGTAGT CAATCTATAT
CTTGTTTTTC CGCTGATGGG CCGCAACTAC GACGCAGCTG TGATTTGCTC AGGATTTGCG
GGCATGTCGA TAGGGTCATC GGCTGCTGGG CTCGCAAATA TGACGGCAAT ATCAAGGAAG
TATGGGCCAA CCAAAAAAGC ATTTATAGTC GTGCCACTGA TCGCAACATT TATTGAGATT
ATAAATTCAG GTCTTATTAT TCCAGCCTCA ATTAGGTTTT TATGA
 
Protein sequence
MDTFEIRSLI SISIGILVLF IGKRINGNIQ VLQNLSIPDS VTGGLIAALA ITLVKWTTDI 
EVIFDLGARD FLLIYFFTTV GISSNLINMR RGGKALGILL GLVIAFITIQ NIFGEWTVNL
FGLQEGLGPL LGSVSLAGGH GVTIAWAPTF AKQFGIKNAL EIGVAASTFG LLVASLIGGP
IANYLIRKNH LNLDHKKEIY NENRNGEENT DKTELNNQLP SKLDYNSILS SVLAVNICII
LGKICQEFLF KLDIQLPLFV VSMIVGIILS SILPDRKILN DFLIWPKQTA ALLLLAELSL
GTFLAMSIMS LQFWDLLDLP PALLFILVFQ TILSIVVNLY LVFPLMGRNY DAAVICSGFA
GMSIGSSAAG LANMTAISRK YGPTKKAFIV VPLIATFIEI INSGLIIPAS IRFL