Gene NATL1_17501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17501 
SymbolmelB 
ID4780241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1432044 
End bp1433447 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID640085038 
ProductGPH family sugar transporter 
Protein accessionYP_001015570 
Protein GI124026455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAATT CCACATCGAT TTACGAAGGA AATAATCGCA CAAGACTAAT GATCTCTTAT 
GCAATGGGAG ATGCTGGAAC GGGATTAGCC GCAATACAGC TAGGTACTTA TCTATTTCTT
TTTTTTACTT GTGCTGCAGG AATTCCTGCA TTTATTGCAG GCTCGCTTCT CATGGTTTCA
AAACTTTGGG ATGCGATAAA TGATCCTCTA ATTGGATGGA TGAGTGATCG CACCAGATCA
AGATGGGGGC CAAGGCTTCC ATGGATGATT GGAGGTGCTG TTCCACTTGG TTTATTCCTT
GCCGCAATGT GGTGGGTCCC TCCTGGAGAT ATAGATGCGA AAACAACTTA TTACGTATTC
GCAGCAATTT TTTTGATGAC AGCTTATACA GCAGTAAATC TACCTTTTGC AGCATTATCT
ACTGAGCTAA CTGAAAATAT AGCTATTAGA ACAAGACTTA ATGCTGCAAG ATTTACTGGG
TCTATTATAG CTGGAACCAC TGGATTAATA GTGGCTGCAG GCTTCTTATC TCAAGGAGTA
GAAGGTTATA CTTCAATGGG AAGAGTAACA GGAGTTATTG CTACTTTTAC CACATTAATT
GCTTGCTGGG GACTAGCCCC ATTTGCTAAA AAAGCCAGAA AGCCCACTTC TCAATCAGAA
CCTTTTAATC AGCAGCTAAA AAGAGTTTTA AATAATAAAC TTTTCACACG AATTATTGCT
CTTTACTTGC TGCTTTGGTG CGGACTGCAA TTAATGCAAA CCGTTTCATT AATCTATCTT
GAGCAAGTAA TGCTTGTTCC AATAGAAATT TCAAAATGGA TCCCTATACC ATTTCAAATT
AGTACTCTAT TAGGTCTACA GTTTTGGAGC TTTTACTCCA ATAAATATGG AAGAATATCA
GCACTATTCA AAGGTGGGAA AATATGGATA TTAGCCTGTT TTTTAGTTAT ATTTATGCCC
CCAATAACTA AAGGAGTCAG TATCAATTCT TTATTAGCCT TTGGTGATAT TGAAGGTATA
AAGCTGTTGA TTCTTTTATT AATAATTATT TTGGTAGGAT TTGGAGCTTC AACAGCATAT
CTTATTCCTT GGTCCTTACT TCCTGATGCT ATTGATCAAG ATCCCGAAAA GCCTTCAGGA
ATATATACAG CATGGATGGT TTTTATTCAG AAAATAGGTA TTGGTTTAAG CGTTCAATTT
CTAGGAGTTC TTTTATCTTT ATCAGGATAT AAATCATCCA CTAATTGCTT ATCAAGTCTT
GAAGACCTAG ATCAACCTCT AACAGCAATT ATTACTATTA GATTATGCAT GGGATTAATA
CCTTCTTTGC TAGTAATTGC TGGATTAATA ACTATGAAAC CGTGGCGAAG TTTAGATTTC
AAATCTAGAA GGTTAAGTCA ATGA
 
Protein sequence
MNNSTSIYEG NNRTRLMISY AMGDAGTGLA AIQLGTYLFL FFTCAAGIPA FIAGSLLMVS 
KLWDAINDPL IGWMSDRTRS RWGPRLPWMI GGAVPLGLFL AAMWWVPPGD IDAKTTYYVF
AAIFLMTAYT AVNLPFAALS TELTENIAIR TRLNAARFTG SIIAGTTGLI VAAGFLSQGV
EGYTSMGRVT GVIATFTTLI ACWGLAPFAK KARKPTSQSE PFNQQLKRVL NNKLFTRIIA
LYLLLWCGLQ LMQTVSLIYL EQVMLVPIEI SKWIPIPFQI STLLGLQFWS FYSNKYGRIS
ALFKGGKIWI LACFLVIFMP PITKGVSINS LLAFGDIEGI KLLILLLIII LVGFGASTAY
LIPWSLLPDA IDQDPEKPSG IYTAWMVFIQ KIGIGLSVQF LGVLLSLSGY KSSTNCLSSL
EDLDQPLTAI ITIRLCMGLI PSLLVIAGLI TMKPWRSLDF KSRRLSQ