Gene Avin_31080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31080 
Symbol 
ID7762008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3213431 
End bp3214741 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID643805983 
Producthypothetical protein 
Protein accessionYP_002800247 
Protein GI226945174 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGA TAACCCTGCA GGTTTACCTG GACGGCCAAT GGCACGATGC GATGCGGGTG 
AGCTTCGATG CTCCCGAAGA CGGGTTGCGC AGTCGTTGCA GCGCTCGCTA CGAGGCCGAC
TATCTCGTCG CCCACCTCGA CGAACTGGGC ACGCCGAAGG CAGCGGCCGT GAGCGCAGTT
TTTTCCCTCG GCTGGGAAGA CTATCGCGGC ATCGCTCCGG CCTTCCTGCA CGACATCGTT
CCCGCCGGCG CCGCGCGCCG GCACATCCTG GCACGGATGG CCGTGCCGCT CGGCGCACCG
GAGGAGTTCT TCCTGCTGCA GCACTGCACC ATGGCGCCCG TCGGTAACCT GCGCGTCAAG
GAGTCCGTCG CCGCCCCGCG CGAGCCGGTC GGCTTCCCTC GCGAGGAGGT GATCCGCCGC
GATATCCGTT TTCTCGACCA TGCCTACGAG CGCGGTGCAG CCATCGGCGG CGCCACCGGG
GCCGGCGGCG AGGCGCCCAA GCTGCTGCTC GCCGAAGACG CCGCCGGCAA CCTCCATCCG
GACGCCGGCC TGCCCGATGC CGAGGTGTGC CGGCACTGGT TCGTCAAGTT CCCGTGCAAT
TCGGGGACGG AAACCGACCG GGTCATCCTG CGCAGCGAAT ACTGCTATTA CCGCGCGCTG
AACCGGCTGG GAATCGAGAC GATCTCCGCC GAAGGGCTGG CCTACGAGGA AGCGGAAAAG
CCCAGCCTGT GGATGCGGCG CTTCGACCGC CGGATCGGCC CGAACGGCGT CGAGCGCATC
GCCGTCGAGT CCGCCTATTC CCTGTGCGGC GTGACCCGGC CGGGCAGCCG CATGGAGCAT
GTCGAGGTCG TCGCCCGCCT GGCGGAAACC TGGGACGCCG CCGGGCAGGC TGCGGAAATT
CCCGCCATGG TCGCCGAGTA TCTGCGCCGC GACCTGCTCA ACCAGATCCT CGGCAACACC
GACAACCACG GGCGTAACCT TTCCATCCTG CGCACGCGCG AGCGCATCGA CCTGGCGCCG
ATCTACGACC TCGCACCGAT GGCGATGGAC CCCGAAGGCG TGGTGCGCAC GACTCGCTGG
CCCGAGGGCA TCGAGCGGTT CGACGGCACC GACTGGCGGG CCGCCTGCAA CGCCCTGTCG
CGCTGGAGCG ACCCCGAACT CCTGTTCGAG CGCTTGCGCG ACGACGCCCG CCAACTGCTG
GCGCTGCCGG ACCTGCTGGC CGAACTGAGC CTGCCCGAAC AGACCTGGAA GGCACCGACC
ATCCCGCTGG GCCGCCTGGA GGTCACCTTG CGCCTCTGGG GACTGCTGTG A
 
Protein sequence
METITLQVYL DGQWHDAMRV SFDAPEDGLR SRCSARYEAD YLVAHLDELG TPKAAAVSAV 
FSLGWEDYRG IAPAFLHDIV PAGAARRHIL ARMAVPLGAP EEFFLLQHCT MAPVGNLRVK
ESVAAPREPV GFPREEVIRR DIRFLDHAYE RGAAIGGATG AGGEAPKLLL AEDAAGNLHP
DAGLPDAEVC RHWFVKFPCN SGTETDRVIL RSEYCYYRAL NRLGIETISA EGLAYEEAEK
PSLWMRRFDR RIGPNGVERI AVESAYSLCG VTRPGSRMEH VEVVARLAET WDAAGQAAEI
PAMVAEYLRR DLLNQILGNT DNHGRNLSIL RTRERIDLAP IYDLAPMAMD PEGVVRTTRW
PEGIERFDGT DWRAACNALS RWSDPELLFE RLRDDARQLL ALPDLLAELS LPEQTWKAPT
IPLGRLEVTL RLWGLL