Gene Avin_20360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20360 
Symbol 
ID7760963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2026483 
End bp2027781 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID643804932 
Productmajor facilitator superfamily (MFS) permease 
Protein accessionYP_002799215 
Protein GI226944142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGCGA CAAGAACAAT GGATGTGCGT GAGCTGATCA ACGGCCGTCC CTTCGGCGGC 
TTCCAGAAAC TGGTGGTGTT CTTCTGCTTC GTCATCATCG CCCTGGACGG CTTCGACGTG
GCGGTGATGG GGCTGATCGC GCCCCAGTTG CGCGAGGACT GGGGGGTGAC CCCGCAGGAG
CTCGGGCCGG TGCTGAGCGC CGCGCTGGTC GGCCTGGCCA TCGGCGCGCT GGTCGCCGGT
CCGCTGGCCG ACCGCTACGG GCGCAAGATA GTGCTGGTGT CGAGCGTGCT GTTCTTCGGC
TTCTGGACGC TGGTCACGGC TTTCTCCGGC GATGTCGGCC AATTGGTGAT CTTCCGTTTT
CTCACCGGTC TGGGCCTCGG TGCCGCCATG CCCAATGCCG CCACCCTGAC CGCGGAATTC
GCCCCGGAGC GCAAGCGCGC CTTTCTGGTC ACCGTGGCTT TCTGCGGCTT TTCCTTCGGC
GCGGCGGGCG GTGGTTTCCT GTCCGCCTGG ATGATTCCCA ACCTCGGCTG GCAGAGCGTG
CTGGTCATGG GCGGGGTGTT GCCGCTTCTG GTGGTGCCGC TGATGCTGTG GAAGATGCCG
GAATCGCTGA GCTTTCTGGT GAGCCGGCGG GCCCCGCGGG AGCGCATCCG GCGCATCGTC
GAGCGGATCG CGCCGGGCGT CGCCGACGGC TGCGGGGAGT TCACGATGCC GAGCGCCCCG
CAGCAGTTGG GGGGCGTGCG GCTGGTACTG TCCAGCCACT ACCGCTTCGG CACCCTGATG
TTGTGGGTGG GCTATTTCAC CGTGCTGTTC CTCGTCTACC TGTTCAGCAG TTGGTTGCCG
ACCCTGGTCA GGTCGGGCGG TTACAGCGTT ACCGACGCGG CCATCGTCAC CTCGATGTTC
CAGGTCGGCG GGCCGATCGG CGCGCTCTGC GTCGGTTGGG CGATGGATCG TTTCCGGCCG
CACGGGGTGC TGCTGCTGAC CATGCTGGTG GCCGCGCTGG CCATCGGTGC CATCGCCTGG
GCGGTGGGCT TCTGCCTGAA CGGCGGCAGC GTCGGCATGA ATGCCATGGC TACCTGCTTC
TATCCCACCG AGGCGCGCGC CACCGGTGCC TGCTGGATGA GCGGAGTCGG CCGCTTCGGC
GCCATCCTCA GCGCCTTCGC CGGCGGCCAG ATGATCGCCA TGGGACTGCC GCTCGGCCAG
ATGTTCGTCC TGCTGGCGGT GCCGGCCGTG GTCTTCGGGC TGGCCCTGGC CGCCAAGGGG
CTGAGCCGGC GCGCCATGCC GCACCTGCGG ACCGCCTGA
 
Protein sequence
MNATRTMDVR ELINGRPFGG FQKLVVFFCF VIIALDGFDV AVMGLIAPQL REDWGVTPQE 
LGPVLSAALV GLAIGALVAG PLADRYGRKI VLVSSVLFFG FWTLVTAFSG DVGQLVIFRF
LTGLGLGAAM PNAATLTAEF APERKRAFLV TVAFCGFSFG AAGGGFLSAW MIPNLGWQSV
LVMGGVLPLL VVPLMLWKMP ESLSFLVSRR APRERIRRIV ERIAPGVADG CGEFTMPSAP
QQLGGVRLVL SSHYRFGTLM LWVGYFTVLF LVYLFSSWLP TLVRSGGYSV TDAAIVTSMF
QVGGPIGALC VGWAMDRFRP HGVLLLTMLV AALAIGAIAW AVGFCLNGGS VGMNAMATCF
YPTEARATGA CWMSGVGRFG AILSAFAGGQ MIAMGLPLGQ MFVLLAVPAV VFGLALAAKG
LSRRAMPHLR TA