Gene Avin_50610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50610 
Symbol 
ID7763910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5127790 
End bp5129100 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID643807890 
ProductGeneral substrate transporter 
Protein accessionYP_002802124 
Protein GI226947051 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.243642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCG AGCACTCCAG CAGCGCGCCC GTGGCGCGTC CGTTGACCCG CAGCGACCTC 
AAGACCCTCT CGCTCTCCGC CCTGGGCGGC GCGCTGGAGT TCTACGACTT CATCATCTTC
GTGTTCTTCG CCACGGTGGT CGGCAAGCTG TTCTTCCCCG CCGAGATGCC CGACTGGCTG
CGCCAGTTGC AGACCTTCGG TATCTTCGCC GCCGGCTACC TGGCGCGCCC GCTGGGCGGC
ATCGTCATGG CCCACTTCGG CGACCTGCTC GGGCGCAAGC GCATGTTCAC CCTGAGCATC
TTCATGATGG CCGTGCCGAC CCTGTGCATG GGTCTCCTGC CGACCTACGC GCAGATCGGC
GTCTGGGCGC CGCTGGCGCT GCTCACCCTG CGCGTGGTGC AGGGCGCGGC GATCGGCGGC
GAGGTGCCGG GGGCCTGGGT GTTCGTCGCC GAGCACGCGC CGCAGCGGCA CGTCGGTTTC
GCCTGCAGCA CCCTGACCGC CGGGCTGACC ACCGGCATCC TGCTCGGCTC GCTGACCGCC
AACGCGATCA ACCGGGCGTT CAGCGCCGAG GAACTGGCCG ACTGGGCCTG GCGCCTCCCC
TTCCTGCTCG GCGGGGCCTT CGGCCTGGTT TCGGTCTACC TGCGCCGCTG GCTGCACGAG
ACGCCGGTGT TCGCCGAACT GCAACTGCGC CAGTCGCTGG CCGCCGAACT GCCGCTCAAG
GCGGTGGTGC GCGAGCACCG TCCGGCGGTG CTGCTGTCGA TGCTGCTGAC CTGGGTGCTG
TCGGCCGGCA TCGTGGTGAT CATCCTGATG ACTCCGACCC TGCTGCAGAC GCTGCACGGC
TTCGCCGCGG AAGAGGCCCT GCGGGCCAAC GGCCTGGCCA TTCTCGGCCT GACCCTCGGC
TGCGTGCTGG CCGGCCTCGC GGCGGACCGC TTCGGCGCCG GGCCGACCTT CGTCTGCGGC
GGCCTGCTGC TCCTGGCCAG TTCATCGGCG TTCTACGCCA GCCTCGCCGG CCACCGCGAC
TTGATGCTGC CGCTGTACGC CCTGGCCGGC CTCTGCGTGG GCAGCATCGG CGCCATCCCC
ATGGTGATGG TCAAGGCCTT CCCGGCGGCG GTGCGCTTCT CCGGGCTGTC GTTCTCCTAC
AACCTGGCCT ACGCCATCTG CGGCGGCCTG ACGCCGATCC TGGTCAGCCT GCTGCTGAAG
TGGAGCCCGC TGGGCCCGGC CTATTACGTC GGCGCACTGT GCCTGCTGTT CATCCTGACC
GGCGCCGGCC TGTGGCGGCG CGGAGCCCCC GCGCTGGCGC CGGCCGGCTG A
 
Protein sequence
MSAEHSSSAP VARPLTRSDL KTLSLSALGG ALEFYDFIIF VFFATVVGKL FFPAEMPDWL 
RQLQTFGIFA AGYLARPLGG IVMAHFGDLL GRKRMFTLSI FMMAVPTLCM GLLPTYAQIG
VWAPLALLTL RVVQGAAIGG EVPGAWVFVA EHAPQRHVGF ACSTLTAGLT TGILLGSLTA
NAINRAFSAE ELADWAWRLP FLLGGAFGLV SVYLRRWLHE TPVFAELQLR QSLAAELPLK
AVVREHRPAV LLSMLLTWVL SAGIVVIILM TPTLLQTLHG FAAEEALRAN GLAILGLTLG
CVLAGLAADR FGAGPTFVCG GLLLLASSSA FYASLAGHRD LMLPLYALAG LCVGSIGAIP
MVMVKAFPAA VRFSGLSFSY NLAYAICGGL TPILVSLLLK WSPLGPAYYV GALCLLFILT
GAGLWRRGAP ALAPAG