Gene PCC7424_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_2831 
Symbol 
ID7110634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp3143962 
End bp3145422 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content44% 
IMG OID643481077 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002378106 
Protein GI218439777 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0218624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTAT CTCAATCTCA AGAAAAATCC TCTCATTCTC AGAAAATCGC TCATGATAAT 
CAGGATCTAG CCAAATCTAA TCCTAACCCC AGTAAACCTT TATCTTCCAA TAAGTCCCCC
TCAACAGAAG GATTAGGGGC AGTTTTAACT AATCCTCGAT TTGTGGTTTT ATGGACCGGA
CAAATTTTTT CTCAACTGGC GGATAAAATT TATCTGGTGC TAATGATCGC CCTCATTACC
AGTCATTTTC AAGCCCCAGA TCAGCCCATT AGTGGATGGG TATCGGCGAT TATGATCGCT
TTTACCATTC CGGCGGTCTT ATTTGGCTCT TTAGCCGGGG TTTATGTGGA TCGCTGGTCA
AAAAAAGGCG TTTTAGTGAT TTCTAATTTG CTGCGAGGGG CATTAGTTTT AATTATTCCT
CCTTTATTGT GGTTATCGGC TCACGAAGCG ATCGCTATTC CCGTCTCTTG GCTACCGGAA
GGGGCACGAC AATGGCAAGG AGAAGCCCAA AGCGTCTTTT ATCTGCCGGT AGGGTTTTTA
ATGCTCTTAG TGCTAACCTT TCTTGATTCT ACCCTAACTC AATTCTTTGC TCCGGCTGAA
CAAGCCATCA TTCCCTTAAT CGTCAAACGT CGTCGCTTAC TCTCGGCCAA TTCTTTGTTT
ACCACCACGA TGATGGCCAC GTTAATCATC GGATTTGCCA TTGGAGAACC CCTATTACAA
TCGGTATCCC ATTTCGTGGG GTTGATGGGG TTTTCTGAAG ATGTGGGGAA AGCTTTAGTC
GTTGGGGGAT CTTATACCTT TGCCGGGTTA ATGCTATTTT TGCTCAGAAC GAAAGAAAAA
TCAGACATTC CTTCTCACGC AAGACCTCAC GTTTTAGAAG ATATTCGAGA TGGTATTGCT
TATCTTCAAA AAAATCATCG GGTGAGAAAT GCCCTCATTC AACTGATTAT TTTATTTTCT
GTGTTTGCCG CTTTATCAGT CTTAGCGGTA CGATTAGCAG AAACCATTCC GGGCATGAAA
GCCGAACAAT TTGGCATTCT CTTGGCAGTA GGAGGTCTGG GGTTAGCTTG TGGGGCGGGT
ATTGTCGGCA ATTGGGGACA ACGGTTTTCC CATACTCAAT TAAGTATATG GGGGTCGGTG
GGCATGGCCA TATCCTTAGT AGGATTATCG TTTTCCGAGG AGCATCTCTG GTGGGCACTC
TCTAGCACCG TTTTTTTGGG CTTTTTTGGC GCTTTAGTGG GCGTTCCCAT GCAGACCACC
ATTCAAGCAG AAACACCGGC TGATATGCGG GGGAAAGTCT TTGGGTTACA AAATAACGCG
GTCAATATTG CTCTGTCTTT ACCGTTAGCT TTGGTCGGTA TTGCGGAAAC TCTACTCGGA
TTACAAACTG TCTTAATAGG GTTAGCTGTT CTTTCATTAC TCGGAGGAGT ATTAACGGGT
TATATCTCTC GCTTAAGTTA G
 
Protein sequence
MQLSQSQEKS SHSQKIAHDN QDLAKSNPNP SKPLSSNKSP STEGLGAVLT NPRFVVLWTG 
QIFSQLADKI YLVLMIALIT SHFQAPDQPI SGWVSAIMIA FTIPAVLFGS LAGVYVDRWS
KKGVLVISNL LRGALVLIIP PLLWLSAHEA IAIPVSWLPE GARQWQGEAQ SVFYLPVGFL
MLLVLTFLDS TLTQFFAPAE QAIIPLIVKR RRLLSANSLF TTTMMATLII GFAIGEPLLQ
SVSHFVGLMG FSEDVGKALV VGGSYTFAGL MLFLLRTKEK SDIPSHARPH VLEDIRDGIA
YLQKNHRVRN ALIQLIILFS VFAALSVLAV RLAETIPGMK AEQFGILLAV GGLGLACGAG
IVGNWGQRFS HTQLSIWGSV GMAISLVGLS FSEEHLWWAL SSTVFLGFFG ALVGVPMQTT
IQAETPADMR GKVFGLQNNA VNIALSLPLA LVGIAETLLG LQTVLIGLAV LSLLGGVLTG
YISRLS