Gene Synpcc7942_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2238 
Symbol 
ID3773894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2303629 
End bp2305044 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content47% 
IMG OID637800685 
Productglucose transport protein 
Protein accessionYP_401255 
Protein GI81301047 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.154375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCT CCATTGCCAA AAGTGTTCAG AGCCATATTG ATGAAACGCC TCCCCGTCCT 
CTGAGCTCCA TGCAGTGGCA AATCTGGAGT TTGGCCGCGA TGGGGAAGCT ATTTGAAGGC
ATGGTGATTT TTATTACCGG TGTAGCTGTT CCATTGATTG AAAGAGACTT TAATCTCTCT
TCTGCACTCA AGGGCTCTGT TGCGGCGGCA TCTTTGTTGG GAATCTTGAT TGGAGCTTCC
CTGTTTGGCA ACTTGGCCGA TCGCTATGGG CGAAAGTTTG TCTTCGTCAT TGAGATGGCA
ATTTTTACAA TTGCGATCGC CCTATCCGCT GTAGCTTGGA ATGTCTCGGC TTTAATTTTC
TTTCTGTTTT GTAGTGGCTT GGCCCTAGGG GCCGATTATC CTATTGCCCA TATTATTGTG
TCTGAATCAA TTCCCAGCCG ATTTCGTGGC CGGATGGTAT TGGGAGCCTT TGCCTTTCAA
TCTGTGGGAT CTCTGGCTGG TGTATTGATT GGCCTACTGG TTTTGCGAAT TTATCCAGAA
GTAGGTGCTT GGCACTGGAT GTATGCAGCT CTTGTGATTC CAAGTATCCT TGTCTTCCTG
ATGAGAACTA AACTACCAGA AAGTCCTCAC TGGTTGGTCT CTCGCAAACA GTTTGAACTC
GCTCATCGGG CGGCTAAAGC ATTGCTGCAA CGGCCTGTAG CGATCGCTCA GAATGATTCC
GATGGGGAGC GGCAAAATCC CCGACTCGGC TACAGAAAAT TATTGACGCC TCGATACCTA
AGAGCGACAG TTCTTACTGC TGTTCCTTGG TTTCTTCAGG ATCTAGCAAC CTATGGAATT
GGAATTTTTA CACCTACGAT ATTGGCAACA TTATTCACAA AAGCTCAGCC TAACTTTGTC
TTTCAAGACA TGATTGCCAC AGAAGGTTCA GGCCTTATCG ATCTATTCTT GCTTGTGGGC
TTTGCTGCTT CTATCCCGTT GGTTGACAAA TTTGGGCGTA TCCCCCTGCA GATCATTGGC
TTTATTGGCT GTTCGGTTGG ACTCATCATT GCTTCTCTGT CGGCAGTGAG CATCCCAGAA
AGTCATGAGC TGAGGATTCT CTTCATTTTT GGTGGATTTA TTCTTTTCAA CTTCATGACT
AATTTGGGGC CGAACGCAAT GACCTACGTT TTGTCAGGGG AAGTATTCCC TACAGAAATA
CGAGGTGTCG GAGCGGGCTT TGCTGCTTCT TTTGCCAAGA TTGGTGCGGT TGCCACAGCC
TTCTTCTTCC CAATCCTGCG TGAGCAAATT GGAACAGTTG CTCTGCTGTG TGGTTTGGCT
GTCACATCGC TCTTGGGTGC GCTGGTAACA TTTCTCTTTC GAATTGAGCC CAATGGCCAC
AGTCTTGAAG AATTAAGCGA GAAGCCTCTT GTTTAG
 
Protein sequence
MSSSIAKSVQ SHIDETPPRP LSSMQWQIWS LAAMGKLFEG MVIFITGVAV PLIERDFNLS 
SALKGSVAAA SLLGILIGAS LFGNLADRYG RKFVFVIEMA IFTIAIALSA VAWNVSALIF
FLFCSGLALG ADYPIAHIIV SESIPSRFRG RMVLGAFAFQ SVGSLAGVLI GLLVLRIYPE
VGAWHWMYAA LVIPSILVFL MRTKLPESPH WLVSRKQFEL AHRAAKALLQ RPVAIAQNDS
DGERQNPRLG YRKLLTPRYL RATVLTAVPW FLQDLATYGI GIFTPTILAT LFTKAQPNFV
FQDMIATEGS GLIDLFLLVG FAASIPLVDK FGRIPLQIIG FIGCSVGLII ASLSAVSIPE
SHELRILFIF GGFILFNFMT NLGPNAMTYV LSGEVFPTEI RGVGAGFAAS FAKIGAVATA
FFFPILREQI GTVALLCGLA VTSLLGALVT FLFRIEPNGH SLEELSEKPL V