Gene Ava_4752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4752 
Symbol 
ID3679639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5977033 
End bp5978199 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content36% 
IMG OID637720108 
Productmajor facilitator transporter 
Protein accessionYP_325244 
Protein GI75910948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAATA AAACCATCGA TTCTCCTTTT GCTCCTGGAC TGCCAGCCCT TTATATTGTG 
GCATTTTTAT CAGGAATGTC TTTAGGGTTA TTTAACCCAT TTATTTCCAC TCTGATGAAA
CAAAATAACA TCAATGATAT TGTCATTGGT GCTAACTCTA CACTCTACTT TTTTATCATT
GCTATTGGCA CCCCATTAGT TACCAAAATT CTCAGTAAGA TAGGTTTACG TAAAACCATG
ATGTTAGGGT TTTTGCTCAT GGGAATAACT GCACCTTTAT TTCCCTTCAC AACCCAATTG
TCTGCTTGGT TTTTGATACG TGCCGTGATG GGTTTAGCTT GTTGTTTATA TCTTATCTCT
GGACAAACTG CTATCAACTA TTTCTGTAAT GATAAAAATC GGGGCATTGT TAATGGGTTA
GATGCTTTAT GTTTTAGCTT AGGATTTGGC ATCGGGCCAG TCATGGGAGC AGCTTTTTAT
AATGCTTCTC CTAAAACAAC ATTCCTCTTA GGCAGTGGAT TAATTTTGAG TGGCATTATT
GTCGTATATT TAGGACTACC AGAAAAAGAA ATTAAGTTTC AAATCCCCCG TTTCCAAATT
ATCAAAAAGC TGAAACTACC CTTACATGGT TCCTTTGCCT ATGGTTTTAG TGTCGCCACA
TTAGTATCTC TTTATCCTCT CTATTTACTA GAACAAAATT ATGGTGTTGA GCGCATTGGT
TATATTTTCG GGCTATTTAT TTTAGGAGGA TTAATATCAA CAGTTCCAGT AAGTCATTTA
GCTGATCGCA TAGGTAAAAT TAAAGTATTA AAGTATAGTG TGATTGTCGT GATTATTTCG
GTTATCGGTT TATCTTTTAT CGACGACCCC AATATCACGC CTTTCTTAGC TTTTATTTCT
GGTGTAGGAA TGAGTCCCAT TTTTCCCCTA TCCTTGGCTT TAATTGGGTC AAGACTGGCA
GTTGATGAAT TGTCCTCTGG CAGTGCTTTA TTTACCTCCA TTTATAGTGC GGGATGTACG
GCTGGACCAA TTTTATCGGC AATAGTGATG ACACTCCTCG GAACACAATA TATTTTTGTG
CTAATGATGG TTATTTTTGT TTTATTTTTC CTGAGTTTAA GTAAACAGAA TAAATATAAC
CATTCACTTC TAAGCGTTGA ACGTTAG
 
Protein sequence
MTNKTIDSPF APGLPALYIV AFLSGMSLGL FNPFISTLMK QNNINDIVIG ANSTLYFFII 
AIGTPLVTKI LSKIGLRKTM MLGFLLMGIT APLFPFTTQL SAWFLIRAVM GLACCLYLIS
GQTAINYFCN DKNRGIVNGL DALCFSLGFG IGPVMGAAFY NASPKTTFLL GSGLILSGII
VVYLGLPEKE IKFQIPRFQI IKKLKLPLHG SFAYGFSVAT LVSLYPLYLL EQNYGVERIG
YIFGLFILGG LISTVPVSHL ADRIGKIKVL KYSVIVVIIS VIGLSFIDDP NITPFLAFIS
GVGMSPIFPL SLALIGSRLA VDELSSGSAL FTSIYSAGCT AGPILSAIVM TLLGTQYIFV
LMMVIFVLFF LSLSKQNKYN HSLLSVER