Gene Avin_39050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_39050 
Symbol 
ID7762794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3953941 
End bp3955161 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID643806768 
Producthypothetical protein 
Protein accessionYP_002801020 
Protein GI226945947 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.121024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA GCCTGCATCA GGCCGGGGTG CGCTTTCTGC GCCTGCTGGG CCGGCATGCC 
GAGCCGATCA TGGACGCCTA CCTGGCCGGC TCGGTGACGG ATCAGGCGCT GGAGCCGGCG
GTCGAGGAGC GGCTGGTCAG GGACGGTATC CTCTACCGTC CCGAGCCCGG CGCCGACCTG
CACCTGCGCC GGGCGGTGCG CGCGCTGCTG GAGGAGGCCC TGCGCGACGA CCGCAACCGG
CAGATCGACG CCAACGCCGG CGCCGCCCTG GCCACCTTCA AGACCCTCGC CGCCCACTAC
AAGGAGGCGC GCCACCAGGG CGACTACGCG GCCGCCGACG CTTACCTGGG CGAACTGCGC
GAGCACGTCT ACGCCTTCGG CGAGACCCTG GGCCACGGCA TTCGCGTGCT GTGGAGCCGG
ATCAACAACG AGTTCGGCTA CGTCGGCACC CTCAACGCCA AGATCCGCGA GAACGAGTTG
GCCCAGTCCC AGGTGAGCGA ACTGCTCGCC GGGCTGGAGC TGATCAGCTT CGAGGAACTG
GCCGAGACCG CCGGCGACCT GCGCGAGCTG CGCCGCCTGC TGGTCACCAG CCTGCAGCGC
ACGGTCAGCG CCTGCTCCCA GGAGCTGAGC GTGGTCCAGG GCCGCCTGCT CGAACTGCTC
GGCCGCTTCC GCCAGATCCG CGGCCGCACC CGTCTGCTCA AGGGCTGGCT GCTGCACATG
GAGCAGCAGC CGGACTACCG GGTGGGCAAC CACGCCGCCC AGCCGCAGGT CCCGCAACTG
TTCAACCAGG CCCCGGCGAT CCTCGCTCCG GCCGCGGTGG ACGTCCACAA CCCGTCGCAG
GAGGAGGTCC TGCTCGCCCT GGTGGCCCAG GCGCGCAGTC TGCAGCCGGC CGAGCGCCTG
GGCCAGGCGC CGGGCGAGGC CGGCGAATTC GTGCTCGGCG CGCCCGAGGA CTTCGAGGTC
GTCGCCAACC CGATCCGCGC GGCCGTCGAG GCCTACTTCT GCCGGATCAT CGACGGTGGC
GAACGGCTTT CGGCCCTGGA GTACCGGGCG CAGCACGAAC TGCCCTGGGA TGCGGAAAGC
TGGCTGTACC AGGTGATCGG CGGCTACGAG GGGCTGCCGG AGGAGCAGAA GCGCCACTTC
GAGCTGGACC CCATCGGCGA GCCGCATCCG GTCTACTCGG GCAATTTCAT CGTGCGGGAC
GTCAGGCTGT GGCTGGCCTG A
 
Protein sequence
MSGSLHQAGV RFLRLLGRHA EPIMDAYLAG SVTDQALEPA VEERLVRDGI LYRPEPGADL 
HLRRAVRALL EEALRDDRNR QIDANAGAAL ATFKTLAAHY KEARHQGDYA AADAYLGELR
EHVYAFGETL GHGIRVLWSR INNEFGYVGT LNAKIRENEL AQSQVSELLA GLELISFEEL
AETAGDLREL RRLLVTSLQR TVSACSQELS VVQGRLLELL GRFRQIRGRT RLLKGWLLHM
EQQPDYRVGN HAAQPQVPQL FNQAPAILAP AAVDVHNPSQ EEVLLALVAQ ARSLQPAERL
GQAPGEAGEF VLGAPEDFEV VANPIRAAVE AYFCRIIDGG ERLSALEYRA QHELPWDAES
WLYQVIGGYE GLPEEQKRHF ELDPIGEPHP VYSGNFIVRD VRLWLA