Gene Avin_33770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33770 
Symbol 
ID7762272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3452075 
End bp3453301 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID643806238 
Producthypothetical protein 
Protein accessionYP_002800502 
Protein GI226945429 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.756882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG AACACGCCGA CCTCGCAGTC GAGCGCGCTT CGCCATGGCT GCCGATCTGG 
GCCGGCCTGT GTGCCAGCCT GGTCGGCATC GGTCTTGCCC GCTTCGCCTA TACCTCCCTG
GTGTCCGCGC TGATCGAAGC GCACTGGTTC TCCCCATCGG CGGTGATCTA CCTGGGAGCG
GCCAATCTCG CCGGCTACCT CATCGGAGCG CTGGGAGGAC GGCCGCTCGC CGCCCGCTGG
TCGGACACCG GAGCACTGCG CGCCATGATG GTGCTGGTGG CCCTGTCGTT CTTCGCCTGT
GCCGTACCGC TGTCCATGAC CTGGTTTTTC GGCTGGCGGC TGCTGTCCGG CATCGCCGGC
GGGGTGATCA TGGTGCTCGT GGCAAGTACC ATCCTGCCGC ATGTCGCGCC GAACCGGCGC
GGCCTGGCCA GCGGCGCGAT CTTCCTCGGC GTCGGCCTGG GCATCGCCGC CTCGGGCAGC
CTGGTGCCGC TCCTGCTCTC CTTCGGCCTG TCGCAGGCCT GGCTGGGTCT GGGCGGGATA
GCGCTGCTGC TGACGGCCAC CAGTTGGTTC GGCTGGCCTC GGGAAAGCCC GCCTGCGCCG
CTCATCTCCG GCGCATCCGC CGTGCGCGGC TCGACGGGCG TCTACGTGCT GTTCGCCCAA
TATGCGCTGA TGGCCGCCGG GCTGGTGGCG CCGATGATGT TCCTGGTCGA CTACGTCAGC
CGCGGCCTCG GCGCCGGCAG CCACAGCGGC GCGCTGATCT GGACGCTGTA CGGGGTGGGC
GCCATCGTCG GGCCGGTGGT GTATGGGTTC CTGGCCGACC ACTGGGGCGC GCGCTCGGCG
ATTCGCCTGG TGCTGCTGGT CCAGGCCGGT GCTCTCGCGC TGCTGGTGAC GGTCGTCGAC
CTGCGCCTGC TGGCCGTCGC GGCGCTGGCG ATAGGTTCGT TTCCGCCCGG CATAGTGCCG
CTGGCACTGG CCCGCGTGCG GCAGTTGGTT GCCGGACATC ACCAGCAGAG CGCCACCTGG
AGCCGTGCCA CGGTGTCCTT CGCGAGCTTC CAGGCGCTGG CCGGTTATGC CTATTCGGCG
CTGTTCTCCG CCAGCCACGG CCACTACACG CTGCTGTTCT CCGTCGCTGC GGGCGCCATC
GGTCTGGCCC TGGCGCTGGA TCTGCTGCTG GCGTTCAAGA AAGTCTTCCG CCCCGCCACC
GGAGCCGACG CGACACCGGA GGCATAG
 
Protein sequence
MNTEHADLAV ERASPWLPIW AGLCASLVGI GLARFAYTSL VSALIEAHWF SPSAVIYLGA 
ANLAGYLIGA LGGRPLAARW SDTGALRAMM VLVALSFFAC AVPLSMTWFF GWRLLSGIAG
GVIMVLVAST ILPHVAPNRR GLASGAIFLG VGLGIAASGS LVPLLLSFGL SQAWLGLGGI
ALLLTATSWF GWPRESPPAP LISGASAVRG STGVYVLFAQ YALMAAGLVA PMMFLVDYVS
RGLGAGSHSG ALIWTLYGVG AIVGPVVYGF LADHWGARSA IRLVLLVQAG ALALLVTVVD
LRLLAVAALA IGSFPPGIVP LALARVRQLV AGHHQQSATW SRATVSFASF QALAGYAYSA
LFSASHGHYT LLFSVAAGAI GLALALDLLL AFKKVFRPAT GADATPEA