Gene Avin_50220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50220 
Symbol 
ID7763873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5090803 
End bp5091942 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content72% 
IMG OID643807853 
ProductLacI regulatory protein 
Protein accessionYP_002802087 
Protein GI226947014 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATTGG GGAGGGAATT GGGCTGCTAT AGTCCGGCCG CGTTATATGT CAGAACATCT 
CAAGGACAGC AGAAGAGCTT TCATTTCATG CCTGAACCCG TCAGCCCTCC GCCCCGGCGT
CCTACCCTGC AGGATGTCGC CCGCGCCGCC GGCGTGAGCC TGTCGACCGT CGACCGGGTG
ATCAACCTGC GCGCCAACGT GCGCGCCGAC ACCGCGCGGC GCATCGCCGA GGCGGCGGCG
CGGCTGGGCT TCCATGGCCG CGGCGTCATC GAGCAGCGCC TGCTGGAACA GCGCCGGACC
GTGCGCCTCG GCTTCCTGCT GTGGAAACGC AACACGGCCT TCTACCGCGG GCTGGCCGAC
GGACTGGCCG AAGCCGCCGC GGCCTGCGTC CGGGCCCAGG TGCGGGTGAC GATCGGCTAC
CTGGACAACC TGGACCCGGA GGCGACCGCG GCGCGCATGC TGGCCCTGCG CGGCGAGGTG
GACGCGCTGG GCATAGTCGC CGCCGACCAC CCGGCCGTGC GCGAGGCGGC GACCGCCCTG
CGCGCGGCGA GCCTGCCGGT GGCGGCGCTG GTTTCCGAGC TGGCCGCCTC GACCGGGGCG
GGCTACGTCG GCCTGGACAA CCGGCGCGTG GGCCGCACGG CCGCCTGGTT CGTCGCGCAG
CTCGCCCGCC AGCCGGGGGC CGTGGCGCTC ATGGTCGGCA GCCAGCGCTT CCAGTGCCAG
GAACTCTGCG AACTGAGCTT CCGCAGTTTC CTGCGCGAGC ACTCCGAGGA CTGGGAACTG
CCGGCGCCGC GCCTGACCCT GGAGGACGAA ACCTTCGCCT ACGAGAACAC CCTGGATCTG
CTCAGCGGCC AGCCCGACCT GGCCGGGCTC TACGTCGGCG GCGGCGGGAT CGAGGGCGTC
CTGCGCGCCC TGCGCGAGCA GCGCGGCGCC CGGCCGGTGG TGGTCTGCCA CGACCTGACG
CCGGTGACGC GGGCCGCCCT GAACGATGGG ATCGTGCAGG TCGTGCTCTC GCATCCGCTG
GCCGAGCTGG CGCGCTCGGC GGTGGACCTG CTGGTCGAGG CCGCGACAGG CCGGGAAGCG
ACGTCCGCGC TGCCCAAGCG CATCCTGCCC ATCCAGATCG ATATTCCCGA AAGCGTCTGA
 
Protein sequence
MLLGRELGCY SPAALYVRTS QGQQKSFHFM PEPVSPPPRR PTLQDVARAA GVSLSTVDRV 
INLRANVRAD TARRIAEAAA RLGFHGRGVI EQRLLEQRRT VRLGFLLWKR NTAFYRGLAD
GLAEAAAACV RAQVRVTIGY LDNLDPEATA ARMLALRGEV DALGIVAADH PAVREAATAL
RAASLPVAAL VSELAASTGA GYVGLDNRRV GRTAAWFVAQ LARQPGAVAL MVGSQRFQCQ
ELCELSFRSF LREHSEDWEL PAPRLTLEDE TFAYENTLDL LSGQPDLAGL YVGGGGIEGV
LRALREQRGA RPVVVCHDLT PVTRAALNDG IVQVVLSHPL AELARSAVDL LVEAATGREA
TSALPKRILP IQIDIPESV