Gene P9211_16021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16021 
SymbolputP 
ID5730898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1427297 
End bp1429060 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content43% 
IMG OID641285980 
ProductNa+/proline symporter 
Protein accessionYP_001551487 
Protein GI159904143 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAA TTGATTGGTT AGTTTTACTT GGGTATTTGA CAGCTACTTT ATTTTTGGGA 
ATAGCTTTGT CGCGGAGAAA TCGTTCTGAT AGTGATTACT TTGTTGCAGG GCGTCGGCTG
ACCGGCTGGC TTGCTGGGGC TTCAATGGCA GCTACAACAT TTTCAATTGA TACACCTCTC
TATGTCGCAG GAGTTGTAGG AACTCGTGGC CTTCCTGGAA ATTGGGAATG GTGGAGTTTT
GGATTGGCGC ATGTAGCAAT GACAGTTGTC TTTGCTCCAA TGTGGCGTCG AAGCGGGGTG
ATTACTGATG CGGCTTTTAC TGAGTTACGT TATGGAGGTT TGCCTGCAGC TTGGTTGCGA
GCAATTAAGG CATTTTTGTT GGCTATTCCT ATTAACTGTA TAGGCATCGG CTATGCATTC
TTGGCCATGC GTAAGGTAGC AGAGGCATTG GGAATAGTGG ATGGGCACAC CATTTTTGGA
CCTTTTAGTG ATACTTTATT GCTCCTTATA GTGGTTGCTT TCCTTTTACT TGTTTATACA
GTTGTTGGCG GATTATGGGC GGTTGTAGTT AATGATTTAA TTCAATTAAT ACTGGCTCTC
CTAGGCGCCT TTGCCGTGGC TTTTGCTGTT ATCCATGCTT CAGGCGGGAT GAATCAAATG
TTATTGAGGT TGCAAGATTT AGATCGACCT GAGCTGCTTT CTATTTTCCC TTGGACATGG
ACAGACAATG GGTTGGAATG GATTGAGAGT GCAGGAATAA GTGTTGCAAC TTTTACGGCC
TTTCTTTCAC TGCAATGGTG GAGTTTTCGA CGTAGTGATG GTGGTGGCGA GTTTATACAG
AGATTACTGG CGACTCAAAA TGAGAAGCAA GCAAAGAGGG CTGGTTGGGT TTTTCTGATT
GTGAATTATC TTGTTAGAAG TTGGCTTTGG ATTGTCGTCG GATTGGGAGC ACTTGTTTTG
TTGCCTGCTC AGCAGGATTG GGAAATGAGT TATCCCACTC TCGCGGTACG ATATCTGCCA
CCTGTGGTTC TAGGGTTAGT AGTTGTCTCT TTAGTAGCAG CCTTTATGAG TACTGTTAGC
ACCTCTATTA ATTGGGGTGC AAGCTATTTG ACTCATGACC TTTATCAAAG ATTTTTTAGG
CCTACGGCTA GTCAAAAAGA ATTGTTGTTG ATAGGCCAAA TAACAAGCTC TATTTTGCTT
CTGTTGGGAA TTTGCACAGC TTTGATCAGC GATAGTATTG GTGCAATCTT TCGTTTGGTT
ATTGCTATTG GTACTGGTCC AGGAGTAGTT CTTGTTTTGC GATGGTTTTG GTGGCGCATA
AATGCAATAG CAGAGTTGGC AGCAATGCTT TCTGGCTTTT TTGTAGGACT TGTGACATCT
ATAGTCCCTG TTCTGAGGAT CGATGACTAT GGAATTAAGT TGATGGTCAC AACTGGCCTT
ACTGCAATTA CTTGGTTAAT TGCTATGTTC ACAACTCCAC CTGAATCAGA AGAGGTTTTA
GAAAAATTTG TAGTTTTAGT AAAGCCTCCT GGTCCTGGAT GGGAGGTTTT AAGAAATAAG
TTTCGAGTGC AGGCTGTTGA TCCTTTGCAA GATTTGTTGA TTCGTTTTTC CTTAAGCATT
GGAGTACTTT TTGGCGGCCT TTTCTCAACA GGATCATTCT TATTGCATCA GGAGAGAGGC
GGGTGGATAG GATTGGTTAT TTGTTCTTTC TGCCTGGTAG GAATCAATGG GAAGTTTGTA
CGAAGAAGTT TTTCTGAGGG TTGA
 
Protein sequence
MTLIDWLVLL GYLTATLFLG IALSRRNRSD SDYFVAGRRL TGWLAGASMA ATTFSIDTPL 
YVAGVVGTRG LPGNWEWWSF GLAHVAMTVV FAPMWRRSGV ITDAAFTELR YGGLPAAWLR
AIKAFLLAIP INCIGIGYAF LAMRKVAEAL GIVDGHTIFG PFSDTLLLLI VVAFLLLVYT
VVGGLWAVVV NDLIQLILAL LGAFAVAFAV IHASGGMNQM LLRLQDLDRP ELLSIFPWTW
TDNGLEWIES AGISVATFTA FLSLQWWSFR RSDGGGEFIQ RLLATQNEKQ AKRAGWVFLI
VNYLVRSWLW IVVGLGALVL LPAQQDWEMS YPTLAVRYLP PVVLGLVVVS LVAAFMSTVS
TSINWGASYL THDLYQRFFR PTASQKELLL IGQITSSILL LLGICTALIS DSIGAIFRLV
IAIGTGPGVV LVLRWFWWRI NAIAELAAML SGFFVGLVTS IVPVLRIDDY GIKLMVTTGL
TAITWLIAMF TTPPESEEVL EKFVVLVKPP GPGWEVLRNK FRVQAVDPLQ DLLIRFSLSI
GVLFGGLFST GSFLLHQERG GWIGLVICSF CLVGINGKFV RRSFSEG