Gene P9303_04421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04421 
SymbolputP 
ID4776985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp447714 
End bp449483 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content57% 
IMG OID640085946 
Productsodium/solute symporter family protein glucose transporter 
Protein accessionYP_001016459 
Protein GI124022152 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA TTGATTGGAT AATCCTCGTT GGTTATCTCG CCATCACTCT CGTGATTGGG 
CTTTGGTTGT CTCGTCGCAA TCGAAGTGAG GATGATTACT TTGTTGCGGG ACGTCGACTC
AGTGGTTGGT TGGCTGGGGC TTCGATGGCC GCAACGACAT TTTCGATTGA TACCCCTCTT
TATGTAGCCG GTCTGGTTGG CACACGAGGA TTGGCCGGCA ATTGGGAATG GTGGAGCTTC
GGCCTGGCGC ATGTTGCCAT GGCAGTTGTC TTTGCTCCGC TCTGGCGACG GAGTGGGGTG
CTGACCGATG CTGCATTCAC CGAGCTGCGT TATGGCGGAG CGCCTGCCGC CTGGCTGCGT
GGGATTAAAG CATTTTTGCT CGCACTGCCG GTGAATTGCA TCGGTATCGG CTATGCCTTC
CTGGCCATGC GCAAGGTCGT AGAAGCCCTG GGCATAGTGT CTGGCAGCCC TGTGGCAGTG
CTTGGCGGTA CCCCTGACAC TGTTGTGCTG TTGGCAGTCG TGGCTGTGAT GGTGCTTGTT
TACACAGTTG CTGGCGGCCT CTGGGCTGTC GTGATCACCG ATTTCGTGCA GTTGCTGTTG
GCTCTTTTGG GAGCGGTGGC GGTGGCGTGG GCCGCCGTTC ATGCAGCCGG AGGGATGGAC
GCCATGCTCG ATCAACTGAG GGGGTTAGAG CGACCTGAAC TGCTGGCGAT CGTGCCCTGG
CAATGGAATG GTGATGGCTT TGATTGGATT GGTGGCGCCG AGATCAGTGT GGCCACATTC
CTTTCTTATC TCACTGTCCA ATGGTGGAGT TTCCGTCGCA GCGACGGTGG TGGTGAGTTC
ATTCAGAGAA TGTTGGCCAC GCGGGACGAG CGTCAGGCTC AACTGGCTGG CTGGGTGTTT
CTGGTGGTCA ACTATCTGGT GCGCAGTTGG TTGTGGGTTG TGGTTGCTCT TGCCGCTTTA
GTGCTGTTGC CAGCTCAGGC CGACTGGGAA CTCAGTTACC CAACCTTGGC TGTGACCTAC
CTGCCACCGG TGGTGCTTGG TCTTGTCGTG GTCTCTCTGG TGGCGGCTTT CATGAGTACG
GTCAGTACCG CAGTGAACTG GGGTGCTAGC TATCTCACCC ACGATCTTTA CCAGCGTTTT
ATAAGGCCGG ATGCTGATCA GAGAGAGCTG TTGCTGGTCG GTCAATTCAC CAGTGTTTTG
CTGCTAGTTC TCGGGGTGAT CACCGCATTA ATTAGCGACA GCATCGGCAC AGTGTTTCGT
CTGGTGATTG CTATTGGTAC CGGGCCCGGA GTGGTGCTTG TGCTGCGTTG GTTCTGGTGG
CGGATCAATG CTGCTGCCGA ATTGGCGGCG ATGGTTTGTG GTTTTGTGGT GGGCCTTTGT
ACATCCGTTG TGCCCTTGGT GACAATTCCT GATTACGGCA CCAGGTTGAT GATCACCACA
GCGATTACTG CTGTGGTCTG GGTTGTAGTG ATGCTGGTGA CTCCGCCTGA ATCGCCGCAA
GTGCTCGAGC GCTTTGTGAA CCAGGTGCAG CCACCAGGCC CAGGCTGGAG TCGCTTGCGT
CAACGCCTGG ATATAACACC GGTGGATTCC CTGGTGAGTT TGTGTTTGCG CTTCATTCTT
AGTGTCGGTG TTCTGTTTGG CGGTCTGCTC GGCACTGGCG CGTTTCTGTT ACATCAGCAG
CTAGGCGGTT GGATCGGTTT GGTGGTCACC GTGGTCTGTG TGCTGCCTTT AACCCGTGGT
TGGCTGCGTA GTCGGATGGG CGTCGCATGA
 
Protein sequence
MAAIDWIILV GYLAITLVIG LWLSRRNRSE DDYFVAGRRL SGWLAGASMA ATTFSIDTPL 
YVAGLVGTRG LAGNWEWWSF GLAHVAMAVV FAPLWRRSGV LTDAAFTELR YGGAPAAWLR
GIKAFLLALP VNCIGIGYAF LAMRKVVEAL GIVSGSPVAV LGGTPDTVVL LAVVAVMVLV
YTVAGGLWAV VITDFVQLLL ALLGAVAVAW AAVHAAGGMD AMLDQLRGLE RPELLAIVPW
QWNGDGFDWI GGAEISVATF LSYLTVQWWS FRRSDGGGEF IQRMLATRDE RQAQLAGWVF
LVVNYLVRSW LWVVVALAAL VLLPAQADWE LSYPTLAVTY LPPVVLGLVV VSLVAAFMST
VSTAVNWGAS YLTHDLYQRF IRPDADQREL LLVGQFTSVL LLVLGVITAL ISDSIGTVFR
LVIAIGTGPG VVLVLRWFWW RINAAAELAA MVCGFVVGLC TSVVPLVTIP DYGTRLMITT
AITAVVWVVV MLVTPPESPQ VLERFVNQVQ PPGPGWSRLR QRLDITPVDS LVSLCLRFIL
SVGVLFGGLL GTGAFLLHQQ LGGWIGLVVT VVCVLPLTRG WLRSRMGVA