Gene Syncc9902_2291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2291 
Symbol 
ID3743445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp2197492 
End bp2198751 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID637772493 
Productmajor facilitator superfamily permease 
Protein accessionYP_378292 
Protein GI78185858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTCTCC TCCGACCACT GGTCGACTTG CAGCGTCCTC GCATACCAAC TCTGCTGAGC 
GCATTTTTAA CGCTGCTCAA CGATCGGCTG AGCGAAAGCA TCGTCTTTCC TCTGCTTCCT
TTTCTACTGG CGCAGTTCGC GCCGAATGGG CGAACGCTGG GGCTCTTAGC GGGCAGCTAC
GCATTATCCC AATTTCTCGT CACTCCACTC ATCGGTGCCT TGAGTGATCG CTATGGCCGA
CGACCGGTGA TTGCCATTTG CGTGGGGGGA TCCGTCGTTG GACTTGGACT CTTTGCACTA
ACCCTGAGCC TGCCCTGGCC AGAAGCGAGT TTGTGGCCCT TGATCTTGTT ATTCAGCGCA
CGCGTCATCG ACGGGATCAG CGGAGGGACA GCAGCAACCG CCTCAGCGGT TCTGGCCGAT
ATCAGCTCCC CGGAACAACG GGCCCGCACC TTTGGCCTCA TCGGAGTGGC CTTCGGCCTC
GGCTTCATTT TGGGACCCTT CCTCGGAGGA CAACTGGCAC AAATCGCCGT TCCACTCCCC
CTGTGGGTTG CCACCGGCTT CGCTTGTCTC AACTTGGGTG TCGTTCTCAC GTTGCTCCCC
GAAACCCACC CGGTTGAGCA AAGACAAAAT CTGCCAAAAC GCCGAGACCT CAATCCATTT
CGTCGCATTG GTCAGGTTCT GATCAACCCG AGTGTTGGAC GACTCTGTGG CGCATTTTTT
CTGTTCTTCC TTGCCTTCAA CGGCTTCACA GCGATCCTGG TGCTGTACTT CAAGCAACGC
TTCGATTGGG GGCCAGAGCT CGCCACAACA GCCTTTCTCG TTGTTGGTGT CGTAGCGACA
GTCGTCCAGG GAGGCTTGAT CGGTCCATTG GTCCAACGCT TTGGTGAATG GAAACTCACC
CTGTTCGGAC TTGGCCTCGT GATCGCGGGT TGCCTGTTAA TTCCAGCCGT CGGAGCAGCC
GATCGAGCGC CCGCCATTTT CTGCTCAGTC GGGATCCTGG CCCTTGGAAC CGGCCTCGTC
ACCCCAAGCT TACGAAGCCT GGTTTCACGA CGACTCAGCA GTGAAGGACA AGGAACGGCA
CTCGGCAGCT TGCAAGCACT GCAAAGTTTG GGAAGTTTTT TAGGCCCACC GATTGCAGGA
ATCAGCTACG ACCTTCTTGG CCCGACCAGC CCTTTCGTAC TCGCCGCATC ACTCCTCGTC
ATCGTCATCG CACTAGTGGC CAGAAGTCCA CTCACCAAAA ACCTTCAACT GAGCACTTGA
 
Protein sequence
MSLLRPLVDL QRPRIPTLLS AFLTLLNDRL SESIVFPLLP FLLAQFAPNG RTLGLLAGSY 
ALSQFLVTPL IGALSDRYGR RPVIAICVGG SVVGLGLFAL TLSLPWPEAS LWPLILLFSA
RVIDGISGGT AATASAVLAD ISSPEQRART FGLIGVAFGL GFILGPFLGG QLAQIAVPLP
LWVATGFACL NLGVVLTLLP ETHPVEQRQN LPKRRDLNPF RRIGQVLINP SVGRLCGAFF
LFFLAFNGFT AILVLYFKQR FDWGPELATT AFLVVGVVAT VVQGGLIGPL VQRFGEWKLT
LFGLGLVIAG CLLIPAVGAA DRAPAIFCSV GILALGTGLV TPSLRSLVSR RLSSEGQGTA
LGSLQALQSL GSFLGPPIAG ISYDLLGPTS PFVLAASLLV IVIALVARSP LTKNLQLST