Gene Avin_31240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31240 
Symbol 
ID7762024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3228828 
End bp3230168 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID643805999 
ProductMajor facilitator superfamily protein 
Protein accessionYP_002800263 
Protein GI226945190 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACCG ACTCGCCCGT AGAAGTCAAA CACTGGATAG ACGGCCAGCC CCTGTGCCGA 
CGCCAGTGGC TGATTCTGGC CCTGTGCTTC TTCATCGTCC TGTTCGACGG CATGGATGTG
GCCGTGATGG GCTTCATCGC GCCGGCGCTG ATGCAGGACT GGCAATTGTC CAAGGCGGCC
TTCGGCCCGG TCATGAGCGC CACCATGGTC GGCCTGGCGC TGGGCGCGCT GGTGGCCGGC
CCCTACGCCG ACCGTTTCGG ACGCAGGAAG GTCCTGCTGG GCGCCGTGAC GGTCTTCGGC
ATACTCAGCC TGGCCTCCGC CTTCGCGCGA AATCCCTACG AACTGGCGAT CCTGCGCTTC
CTGACCGGTG TCGGCCTGGG CGCAGCCATG CCCAACACCA CGACGCTGCT TTCCGAGTAC
CTGCCCGAGC GCTACCGTTC GCTGCTGATC ACGGTCATGT TCACCGGCTT CAACCTGGGC
TCGGGCGGCG CCGGCTTCGT CGCGGCCTGG GTGATTCCCC AGTACGGCTG GCATGGCGTG
CTGCTGGTCG GGGGACTGCT GCCCCTCGCC CTGCTGCCGC TGCTCTGGCT ATTCCTGCCG
GAATCGGCAC GCTTTCTGGT CGCGAACAAC GCACCGGCCG AGCGCATCGC GAAATTGCTC
AACAAGCTCG GCGGACAATT CGGCCCGGCG ACCCGCTTCG TCACCGCCGA GCAACCCGTC
CGGCACAAGG CCCCGGTACG CCGGCTGTTT TCGGAACAAT ACCGCCTGGG CACCTTCGCC
CTGTGGGTCA CCTACTTCAT GGGCCTGCTG GTGATCTACC TGACGATGGG CTGGCTGCCG
ACGCTGATCC GCGAGAGCGG CATCTCCATC GAGCGCGCGG CCACCGTCAC CGGCCTGTTC
CAGATCGGCG GGACCGTCGG GGCGATCGCG GTCGGCTGGA TCATGGACCG CCACAGCCCC
AACCGCGTCA TCGCGACGGC CTATGCCCTG GGCGGCGCGT TCATCCTGCT GCTCGGCGTC
CTGGGACTGG AATCGGAACT GCTGACCGTC GGCGTGCTCG CCGCCGGGTT CTGCATCAGC
GGCGCGCAAA CCGGGCTCAA CGCCTTCGCC CCCGGCTATT ACCCCACCGA ATCCCGCGCC
ACCGGGGTGA GCTGGATGCT GGGCATCGGC CGCTTCGGCG CCATCTTCGG CGCCATGATC
GGCGGCCTGA TCCTGAGTCT GGGCCTGGGC TTCGGCCTGA TCTTCGGCGC CCTGGCGATC
CCCGCCTTCA TCGCGGCGCT GGCGATCCTG CTGAACGGCT ACGCCAGCCG CCGCCTGCCG
CGGACCGCGC TGGCGCTCTG A
 
Protein sequence
MHTDSPVEVK HWIDGQPLCR RQWLILALCF FIVLFDGMDV AVMGFIAPAL MQDWQLSKAA 
FGPVMSATMV GLALGALVAG PYADRFGRRK VLLGAVTVFG ILSLASAFAR NPYELAILRF
LTGVGLGAAM PNTTTLLSEY LPERYRSLLI TVMFTGFNLG SGGAGFVAAW VIPQYGWHGV
LLVGGLLPLA LLPLLWLFLP ESARFLVANN APAERIAKLL NKLGGQFGPA TRFVTAEQPV
RHKAPVRRLF SEQYRLGTFA LWVTYFMGLL VIYLTMGWLP TLIRESGISI ERAATVTGLF
QIGGTVGAIA VGWIMDRHSP NRVIATAYAL GGAFILLLGV LGLESELLTV GVLAAGFCIS
GAQTGLNAFA PGYYPTESRA TGVSWMLGIG RFGAIFGAMI GGLILSLGLG FGLIFGALAI
PAFIAALAIL LNGYASRRLP RTALAL