Gene Ava_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3989 
Symbol 
ID3680460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4961091 
End bp4962323 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID637719341 
Productmajor facilitator transporter 
Protein accessionYP_324489 
Protein GI75910193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCAA ACCACGTAAT TTATCAATTA TCCCCCGGTT TTATGACAGA ATTTACTTCA 
ACGAAATCAA ATTCCCCTTT TTCTACTGGA TTACCAGCTT TGTATAGCAT AGCTTTTTTG
TCTGGTATTT CTATAGGGCT ATTTAATCCC TTTATCTCAA CATTAATGGC GCAACATCAA
GTTGATGATT TATGGATAGG AGCAAATTCT ACGGTGTATT TTCTAGTCAT AGCGTTGGGA
ACACCGTTAG TAGTAAAAGT ATTACCCAAG TTAGGGCTTC GTAAAACGAT GATGCTTGGC
TTGACAATGA TGGGTATTAG CGCTCCTTTG TTTACCATGA CTACATCAAT GCCTTTGTGG
TTCATTATAC GTGCTGTTAT GGGCATTGCT TGTTGTTTAT ATTTAGTCAG TGGAAACACT
GCATTGAATC ATTTTTGTCA TGAAGGTAAT CGAGCGATAG TTAACGGTTT GAATGCTCTA
GCTTTTACTT TTGGATTTGG TATTGGCCCT GTAATTGGTT CTGCTTTTTA TAATGTTTCA
CCAAAACTTT CATTTTTGTT GGGTAGTGCT TTAATTTTTA GCGGTGTAAT TGTAGTTTGG
ATAGCTTTAC CAGATAAGGC AGTTGTTTTT CAACAATCTT CACGTTCCAG AATTTTTAAC
AAACTCAAAC TTCCCCTTCA GGGCGCATTT GCCTATGGTT TTGCCGAATC AACGCTAGTT
TCTTTATATC CGGTTTATCT GCTACGACAA AATTACAATA TAGAGCAGAT CGGCTATACC
TTCGCTGTAT TTGTAGTTGG CGGCTTGCTC TCTACTGTTC CCGTTACTCA CATAGCAGAC
AAATTCGGCA GACTCAAAGT TCTGTTTATG AGTGTGTTTA TCGTCATATT GTCGTTTTTA
TCTCTTTCAT TGATTCAAAA CTCTACGGCT ACCCAGATAT TTGCATTTAT TGCTGGAGCT
AGTATTAGTC CAATTTTTCC CTTAGCAATG GCATTGATTG GTGCAAAACT CTCTAGAAAT
GAACTATCTT CTGGCAGTGC TTTGTTCACG GCTATATATA GTTTCGGATG TACTGCTGGG
CCGATCGCTT CATCTTTAGC TATCAAAGTT TTTGGGGATA GTTATATATT TAGTTTGACA
ATAATTATCT TTGCCATATT TTTGGTTTAC CTGAGTATAC CAAATAAAAA TTTTCGTACC
TATTTACTTA ATGTGGCACG GAAAATACAT TGA
 
Protein sequence
MHSNHVIYQL SPGFMTEFTS TKSNSPFSTG LPALYSIAFL SGISIGLFNP FISTLMAQHQ 
VDDLWIGANS TVYFLVIALG TPLVVKVLPK LGLRKTMMLG LTMMGISAPL FTMTTSMPLW
FIIRAVMGIA CCLYLVSGNT ALNHFCHEGN RAIVNGLNAL AFTFGFGIGP VIGSAFYNVS
PKLSFLLGSA LIFSGVIVVW IALPDKAVVF QQSSRSRIFN KLKLPLQGAF AYGFAESTLV
SLYPVYLLRQ NYNIEQIGYT FAVFVVGGLL STVPVTHIAD KFGRLKVLFM SVFIVILSFL
SLSLIQNSTA TQIFAFIAGA SISPIFPLAM ALIGAKLSRN ELSSGSALFT AIYSFGCTAG
PIASSLAIKV FGDSYIFSLT IIIFAIFLVY LSIPNKNFRT YLLNVARKIH