Gene Avin_20840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20840 
Symbol 
ID7761009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2077223 
End bp2078434 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID643804979 
Productmultidrug/chloramphenicol efflux transporter, major facilitator superfamily MFS_1 
Protein accessionYP_002799260 
Protein GI226944187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.290056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC GCAACGTTGT CCATCGCGGT GTCTGGGCGC TGGCGGTCAC GGCGTTCGCC 
ATCGGCGTGG CCGAGTTCAT CGTGGTCGGC GTGCTGCCGG CCATCGCCGA GGACCTCGGC
GTACCGCTGG CACGCGCCGG CGGACTGGTG GGGTTGTACG CGCTGGCATT GGCCATCGGT
ACGCCGCTGG TGGTACTGGG ATTGGCCCGG CTGCCGCGCA AGCCTGTGCT GCTGACCTTG
GTGGCTGTGT TTCTCGCCGG CAACCTGCTG TCGGCGCTCT CGACCAGCTA TGCGGTGCTG
CTGGCCGGGC GCATCTTGAC GGCGGTGGCT CACGGCAGCT TCTTCGCCAT CGGCGCGACA
CTGGCAGCCC GGCTGGCCCC CGAGGGGCAG GCCAGCCGGG CGATCGCATT GATGTTCGCG
GGCCTCACGC TGGCGATGGT GATCGGCGTG CCGCTGGGTA GCCTGATCGG CAACGGCCTG
GGCTGGCGGC TGCCATTCTT CGCCGTCGTG CTGCTGGCCG CGCTGGGCTG GCTGGCGACC
GCACTGTGGG TGCCGGCCCT GCCGGCGCAG GCGGCGGGGC GCGCCGGTAG CCAACTGGCG
GCGCTGGCGC GACCCGAGAT CCTGACGATG ATGAGCATCA CCATCCTCGG CTTCGGTGCC
AGCTTCGCGG CCTTCACTTT CATCACGCCG ATCCTGACCG CCATCACCGG CTTCTCGGCC
CGGATCTCCA GCCTGCTGCT GGTGGTGTTC GGCGCGGCGA CGCTGGTGGG CAATCTCATG
GGCGGGCGCT GGGCCGCCAG CCTGGGCTGG CCGGTAGCGC TGCGGCGCAT GCTGGTGGGC
CTGCTGGTCG TACTCGTGGC GATCGCATTG CTGATGCCCT ACCGGACACC GATGGTGGCG
CTGCTGTTCG TCTGGGGCCT GCTGGCCTTC GGTATGTCGC CGGGCTTTCA GGCCGGCATG
CTGGCTACCG CCGAACGCTG GACGCCGCGT GCGGTGGACT TCGCTTCGGC GTTGAACATC
TCGGCATTCA ACCTGGGTAT CACGCTGGGG GAGACGTTGG GTAGCGTACT GGTGGTGCGA
GACGACATGG CGCTGACGCC TTGGGCGGGT GTCGGGCTGG CGTTGATCGC GCAGTTGCCG
CTGGCGTGGC TGGCACAGCG GTCGTCCGGC GCCGGAACGG TACCGGCCGC TGGTGGATGG
GAGGGGCGAT GA
 
Protein sequence
MSQRNVVHRG VWALAVTAFA IGVAEFIVVG VLPAIAEDLG VPLARAGGLV GLYALALAIG 
TPLVVLGLAR LPRKPVLLTL VAVFLAGNLL SALSTSYAVL LAGRILTAVA HGSFFAIGAT
LAARLAPEGQ ASRAIALMFA GLTLAMVIGV PLGSLIGNGL GWRLPFFAVV LLAALGWLAT
ALWVPALPAQ AAGRAGSQLA ALARPEILTM MSITILGFGA SFAAFTFITP ILTAITGFSA
RISSLLLVVF GAATLVGNLM GGRWAASLGW PVALRRMLVG LLVVLVAIAL LMPYRTPMVA
LLFVWGLLAF GMSPGFQAGM LATAERWTPR AVDFASALNI SAFNLGITLG ETLGSVLVVR
DDMALTPWAG VGLALIAQLP LAWLAQRSSG AGTVPAAGGW EGR