Gene Avin_51940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51940 
Symbol 
ID7764031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5298757 
End bp5300289 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content72% 
IMG OID643808010 
Productextracellular solute-binding protein 
Protein accessionYP_002802244 
Protein GI226947171 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000130332 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCCGT TCCGCCACCT CTGCCTGCTG TCGCTGCTGG CCTGCCTGGG CGCCTGCGGC 
CCGAACCAGG ACCCGGAGGT GCTGACCCTC GGCGGTCCCT TCGAATTCAC CAGCCAGGAC
CCGGCGCGCG ACGGCTTCGT CTATACCCGC CTGCAGGTGG CCGAGAGCCT GCTGGAGGTG
GACGACGCCG GCCGGCTGCT GCCCGGCCTC GCGCAGGGCT GGGCAGTCGA CGACGACGGA
CTGACCTGGC ATTTGCGCCT GCGCGAGAAG GTGCGCTTCC ACGACGGCCT GCCGCTGGAC
GCCGATGCCG TGGTCCGGGC GCTGGAAATC GCTCGGCGCA AGCCCGGCGT GCTGCGCTCG
GCGCCGATCG TCGAGATCCG CGCCGAGGAC CGGCTGGGCG TGGCCATCCG CCTCGCCAGG
CCCTACAACC CGCTGGGTGC GCTGCTGGCG CACTATTCGA CGGTCATCCT CTCCCCCGCC
TCCTACCGGG ACGGCAGCGA GGTCGGCTGG ATGCAGGGCA CCGGCCCCTA CCGCCTGGAA
GCCTTCGATC CGCCGCACCG CATCCGCGTG ACCCGCTTCG ACGGCTACTG GGGCACCCCG
GCGCGCATCC CGCAAGCGCT CTACCTCACC GGGCACCGCG CCGAGAGCCG CGCCCTGCAG
GTCATGGCCG GGCAGACCGA CATCGTCTAC ACCCTCGACC CCGCCAGCCT GGACCTGCTG
CGCCGGCAGA AGGACATCCG CGTGCATTCC GACGCCATCC CCCGCACCAT CCAGATCAAG
CTCAACGCCG GCCATCCGTT CCTCGCCGAG CGCGATGCCC GGCTGGCCAT GAGCCTGGCC
CTGGACCGCC AGGGCATCGC TAGCCACCTG GTGCGCGTGC CCGGCATGGA AGCCAACCAG
TTGATCCCGC CGGCGCTGGC CGACTGGCAC CTCGACGACC TGCCGCCGAT CCGCCGGGAC
CCCGAACGCG CGCGGCGACT GCTCGCCGAT CTCGGCTGGC GACCGGGGCC GGACGGCATC
CTGCAACGCG CCGGCCAGCG CTTCCGGCTG ACCCTGGTCA CCTACGCCGA CCGCCCCGAA
CTGGCGGTGG TCGCCACGGC CATCCAGGCG CAACTGCGCG AGGTCGGCGT CGCCGTCGCC
GTGGGCATCG TCAACTCCAG CGGCATCCCT TCCGCCCACC ACGACGGCTC GCTGCAACTG
GCCCTGGTGG CGCGCAACTA CGGCAACGTC GCCGATCCCC TGAGCCTGCT GGCCGCCGAT
TACGGCGACG GCGGCAATGG CGACTGGGGC GCGATGGGCT GGCGCAACGA GGAATTGCCG
GCCCTGCTGA GAGGGCTCGA AGCCGAACGC GACCCGGCGC GCTACCGGGC GGATGCCCGG
CGGATCGCGC GCATCCTCGC CGAGGAACTG CCGGTGATCC CGGTGCTCTT CTACACGCAA
CAGACGGCCG TCGCGGCCCG CGTGCGGGAT TTCGGCTTCG ACCCCTACGA GCGCAACTAC
CGCATTTCCC GGATGAGCTT CGCGAGCCCA TGA
 
Protein sequence
MRPFRHLCLL SLLACLGACG PNQDPEVLTL GGPFEFTSQD PARDGFVYTR LQVAESLLEV 
DDAGRLLPGL AQGWAVDDDG LTWHLRLREK VRFHDGLPLD ADAVVRALEI ARRKPGVLRS
APIVEIRAED RLGVAIRLAR PYNPLGALLA HYSTVILSPA SYRDGSEVGW MQGTGPYRLE
AFDPPHRIRV TRFDGYWGTP ARIPQALYLT GHRAESRALQ VMAGQTDIVY TLDPASLDLL
RRQKDIRVHS DAIPRTIQIK LNAGHPFLAE RDARLAMSLA LDRQGIASHL VRVPGMEANQ
LIPPALADWH LDDLPPIRRD PERARRLLAD LGWRPGPDGI LQRAGQRFRL TLVTYADRPE
LAVVATAIQA QLREVGVAVA VGIVNSSGIP SAHHDGSLQL ALVARNYGNV ADPLSLLAAD
YGDGGNGDWG AMGWRNEELP ALLRGLEAER DPARYRADAR RIARILAEEL PVIPVLFYTQ
QTAVAARVRD FGFDPYERNY RISRMSFASP