Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17221 |
Symbol | |
ID | 4777029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1507190 |
End bp | 1508434 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640087231 |
Product | hypothetical protein |
Protein accession | YP_001017731 |
Protein GI | 124023424 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0786] Na+/glutamate symporter |
TIGRFAM ID | [TIGR00210] sodium--glutamate symport carrier (gltS) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACAT TTGAAATTAG ATCGCTGATA TCAATCTCAA TTGGCATACT TGTACTGTTT ATTGGCAAGC GCATCAATGG CAATATTCAA GTGCTGCAGA ATTTATCGAT TCCTGATTCT GTTACAGGCG GACTGATTGC AGCACTAGCA ATCACACTTG TGAAATGGAC AACGGATATA GAGGTTATAT TTGATCTTGG TGCAAGAGAT TTCTTGCTTA TTTATTTCTT TACTACGGTT GGTATAAGTT CAAATTTAAT CAACATGAGA AGGGGTGGGA AAGCCCTGGG AATATTACTC GGTCTGGTTA TTGCTTTCAT TACGATTCAG AACATTTTCG GTGAATGGAC AGTTAATCTA TTTGGACTCC AAGAAGGCCT TGGACCACTT TTGGGTTCAG TTTCGCTTGC AGGTGGCCAT GGTGTGACCA TCGCATGGGC GCCAACCTTT GCAAAACAAT TTGGCATCAA GAATGCTTTG GAAATTGGCG TTGCAGCATC TACATTTGGG TTGTTGGTTG CCAGCCTAAT AGGTGGGCCA ATCGCCAATT ACTTGATTCG AAAAAATCAT CTGAATTTAG ATCATAAAAA AGAAATATAT AATGAAAACA GGAATGGAGA AGAAAATACA GACAAGACGG AATTGAATAA TCAATTACCA TCAAAATTGG ATTACAACAG TATTCTTTCG TCAGTGCTGG CTGTCAATAT CTGTATTATT CTCGGGAAAA TATGCCAGGA ATTTCTTTTT AAGTTAGATA TTCAACTGCC ACTTTTTGTT GTTTCTATGA TCGTTGGAAT CATTTTGAGC AGCATTTTGC CGGATCGTAA GATTTTAAAT GATTTTCTAA TTTGGCCTAA GCAAACAGCT GCTTTATTAC TTCTGGCAGA GCTTTCCCTT GGCACATTTC TGGCAATGTC AATCATGAGT CTGCAGTTTT GGGATCTCTT GGATCTCCCT CCGGCCTTGC TGTTTATATT GGTTTTCCAA ACTATCCTCT CTATCGTAGT CAATCTATAT CTTGTTTTTC CGCTGATGGG CCGCAACTAC GACGCAGCTG TGATTTGCTC AGGATTTGCG GGCATGTCGA TAGGGTCATC GGCTGCTGGG CTCGCAAATA TGACGGCAAT ATCAAGGAAG TATGGGCCAA CCAAAAAAGC ATTTATAGTC GTGCCACTGA TCGCAACATT TATTGAGATT ATAAATTCAG GTCTTATTAT TCCAGCCTCA ATTAGGTTTT TATGA
|
Protein sequence | MDTFEIRSLI SISIGILVLF IGKRINGNIQ VLQNLSIPDS VTGGLIAALA ITLVKWTTDI EVIFDLGARD FLLIYFFTTV GISSNLINMR RGGKALGILL GLVIAFITIQ NIFGEWTVNL FGLQEGLGPL LGSVSLAGGH GVTIAWAPTF AKQFGIKNAL EIGVAASTFG LLVASLIGGP IANYLIRKNH LNLDHKKEIY NENRNGEENT DKTELNNQLP SKLDYNSILS SVLAVNICII LGKICQEFLF KLDIQLPLFV VSMIVGIILS SILPDRKILN DFLIWPKQTA ALLLLAELSL GTFLAMSIMS LQFWDLLDLP PALLFILVFQ TILSIVVNLY LVFPLMGRNY DAAVICSGFA GMSIGSSAAG LANMTAISRK YGPTKKAFIV VPLIATFIEI INSGLIIPAS IRFL
|
| |