Gene Avin_35500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_35500 
Symbol 
ID7762445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3623606 
End bp3624736 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID643806419 
ProductMajor facilitator superfamily transporter 
Protein accessionYP_002800674 
Protein GI226945601 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT GCGTGGCACT GCTGATCGCC TCCGAATTCA TGCCGGTCAG CCTGCTGACA 
CCGATCGCCG AAGACCTGCA GGCCACCCAG GGCCTGGCGG GACAGGCCAT CTCGGTGTCC
GGCCTCTTCG CGGTGCTGTC CAGCCTGTTC ATCGCACCTG TCGCCGGGCG CTTCGATCGC
CGGCACGTCC TGATGGGCAT GACCCTGCTG ATGCTGCTGT CGCTGATCAT GATCGCCCTG
GCGCCCGACT TTTCCCTGCT GATGACGGCC CGCGCCCTGC TGGGACTGGC GATCGGCGGT
TTCTGGTCGC TGGCCACGGC GACCGTCATC CGGCTGGTGC CTGCCGAGCG TGTACCGAAG
GCGCTTGGCA CCATCTACAT GGGCAATGCC ATCGCCACGA CGTTCGCCGC TCCGATCGGG
GCCTACGTGG GAGGACTCCT CGGATGGCGA GTGGTGTTCG GCGCTCTCGT TCCGCTGGTG
ATCGTCAACC TGCTGTGGCA GGCCAGGAGC CTTCCCGCGA TGCCGCCGCA GGCGACCATT
CCGACGATGC GCCTGTTCGA GCTACTCAAA CGCAGGCATG TCGCCCTGGG CATGATCTCG
ACCATGCTGA CGTTCGCGGG TGCGTTCGGC GCCTTCACCT ATTTCCGCCC CTTCCTGGAG
GCCGAAACGG GCGTGAGCCC GAGCCAGTTG CCGCTCCTGC TGCTCGGCCT TGGCATGGCC
GGATTCGCGG GAACCCACTT CGCCAGCGCC ATGCTCCATC GGCAGCGCCT CTACCCGCTC
CTGCGCTACC TCCCCCTGGC CCTCGCCGTC GTCACCCTGG CAATAATGGA GGCCGGGCAT
CTCTTCTGGG CGGTGGCGGC CATGATGGTC GGCTGGGGAG CGCTGAACGC TGCCATACCG
GTGTGCTGGT CCACCTGGCT GGCCAAGGAA ATCGATGACG AGCCGGAAAG CGGCGGTGGA
CTGCTGGTCG CCTCGATCCA GTTGGCGATC ATGATGGGCG GCGCACTGGG CGGCCAGCTC
CTGGACCGTG CCAATGCTTC CGCTCCCCTG GTGGGCGGCG CGATCCTGCT GGTCCTTTCC
GCTCTTGTCA TCGGCAACGG CACCCGCATC AGGAAACCGG CACGGGCCTG A
 
Protein sequence
MALCVALLIA SEFMPVSLLT PIAEDLQATQ GLAGQAISVS GLFAVLSSLF IAPVAGRFDR 
RHVLMGMTLL MLLSLIMIAL APDFSLLMTA RALLGLAIGG FWSLATATVI RLVPAERVPK
ALGTIYMGNA IATTFAAPIG AYVGGLLGWR VVFGALVPLV IVNLLWQARS LPAMPPQATI
PTMRLFELLK RRHVALGMIS TMLTFAGAFG AFTYFRPFLE AETGVSPSQL PLLLLGLGMA
GFAGTHFASA MLHRQRLYPL LRYLPLALAV VTLAIMEAGH LFWAVAAMMV GWGALNAAIP
VCWSTWLAKE IDDEPESGGG LLVASIQLAI MMGGALGGQL LDRANASAPL VGGAILLVLS
ALVIGNGTRI RKPARA