Gene Avin_40880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_40880 
Symbol 
ID7762973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4123650 
End bp4125512 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content70% 
IMG OID643806947 
Producthypothetical protein 
Protein accessionYP_002801198 
Protein GI226946125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAC GCCTGCTTCC CCTTCTGCTG GGCGGCGCCT GCGCCACGCT CCAGGCCGAT 
CCGCAACCAT CGGCGGCGCG TCTGCACGAG TTGGCCGCGG AACCCTTCTG GCTCGCTCTC
GGCCACTACC AGCGCGACCG ATTGGGCGGC TGGCGCAGCC ACGTCGACGA CGGCGGATTC
TTCCTGGCCG AGCGGGGCGA ACGCGATCCG GCGGCGGAAC TCGCCGCCAC CCTGGTCGCC
CTCTACCGGG ACCCGGCCCT CGCCGACCGC CATCCGCAGT GCCGCTTCCC GGCCCGCACC
CGCTGGCTGC GCGAGCGCCT GGCGCTGGAC GACCTGCCGC AGCCGCAGTG CCGCGAATAC
ACCGACTGGT ACGCCGACAT CGCCCCGCAC AGCACGGTAC TGGTGTTCCC CGCCGCCTAC
CTGAACAGCC CCTCGTCGAT GTTCGGCCAC ACCCTGCTAC GCATCGATCC GCCCGGCATC
GACAGCCAGG GCAGCCCGCT GCTCAGCTAT GCGCTGAACT TCGGCGCCTA CATCGAGGGC
AGCGACAACA GCATCCTCTA TGCCTGGAAG GGCCTGATGG GCGGTTATCC CGGTCTCTTC
GCCCTGCTGC CCTACCGGGA AAAGCTCGCC GAATACAGCC GCCTGGAAAA TCGCGATCTC
TGGGAATACC GCCTGAACCT CAGCGCCGAG GAAACCGGGC GGATGGTCGA GCACGTCTGG
GAACTGCGGC AGATCCAGTT CGACTACTAC TTCTTCGACG AAAACTGCTC CTACCGCCTG
CTCGAGTTGC TGGAAATCGC CCGTCCGGGC CTGGAGCTGA CCGACCGTTT CCCGCTCACC
GCCATCCCCG CCGACACCGT GCGCGCAGTG GAGCAGGCCG GGCTGATCGA GCGCAGCGCC
TACCGTCCCT CGCGGGAGAA GGAACTGCTC GCGCGCGCCG ACCCGCTGAG CCGCCCCGAG
CGCGACTGGA CGCGGCGCCT GGCCGGGGAC AGCACCGCGC TCGACGCTGC GGCCTTCCAG
GCGTTGCCGG CCGAACGCCG GGCACTGGTA CAGGACGCCG CCTACCGGCT GATCCGCTAC
CGCAAGAACG GCGAGGAACG CGACGACACC AGCGCCCGCG ACAGCTACCG CCTGCTGCAG
GCGATCAACC GCGCGCCGCC GCCGCGCCTC GCCGTGGAGC GCCCGGTGCC GCCGGAAAAC
GGCCACCAGT CGCGCACCTG GCAGCTCGCC CTCGGCAGCC GCGAAGACCG GAGCTTCGCC
GAATACGGCC TGCGCATGGC CTACCATGAC CTGGCGGACA ACCTGGATGG CTTCCCGCTC
GGCGCGCAGA TCGAGATCGC CCAGCTCAAG CTGCGCCAGT ACGAGGGCGA CCACTGGCAA
CTGCAGCGGC TGGATGTGGC CAACATCCGC TCGCTGACCC CGCGCAGCAC TCTGCTCTCG
CCCCTGTCCT GGCAGGTCGG CGGCGGTCTG GAGCGGGTCG TCGGCAAGGG ACGTCACGAC
GACGAGCAAC TGGTCGGCCA TCTGAACGGC GGCGTCGGCG CCACCTGGCA CCTCGCCGAC
GACCTGCTCG GCTTCGCCCT GGCCACCGCC CGGCTGGAGC GCAACGGGGA CTTCGCCGCC
TTCCTCAGCC CGGCGGCCGG CTTCGATGCC GGCCTGCTGT GGCGCAACCG CCTGGGCAAC
CTCGGCCTCG AGGCCCGCGC CGACTACTTC CACAACGGCG AGGTACGCCG CGAGCTGAGC
CTGGTCCAGC AATGGGAAAT CGCCGGCAAC CTCGGCCTGC GCCTGGCCGC CCGGCGCGAA
TTCAGCCAGC AGGGCGCGCC GGCCAGCGAA CTGAGCCTGC AGTTGCGCTG GTATCACTAT
TGA
 
Protein sequence
MFKRLLPLLL GGACATLQAD PQPSAARLHE LAAEPFWLAL GHYQRDRLGG WRSHVDDGGF 
FLAERGERDP AAELAATLVA LYRDPALADR HPQCRFPART RWLRERLALD DLPQPQCREY
TDWYADIAPH STVLVFPAAY LNSPSSMFGH TLLRIDPPGI DSQGSPLLSY ALNFGAYIEG
SDNSILYAWK GLMGGYPGLF ALLPYREKLA EYSRLENRDL WEYRLNLSAE ETGRMVEHVW
ELRQIQFDYY FFDENCSYRL LELLEIARPG LELTDRFPLT AIPADTVRAV EQAGLIERSA
YRPSREKELL ARADPLSRPE RDWTRRLAGD STALDAAAFQ ALPAERRALV QDAAYRLIRY
RKNGEERDDT SARDSYRLLQ AINRAPPPRL AVERPVPPEN GHQSRTWQLA LGSREDRSFA
EYGLRMAYHD LADNLDGFPL GAQIEIAQLK LRQYEGDHWQ LQRLDVANIR SLTPRSTLLS
PLSWQVGGGL ERVVGKGRHD DEQLVGHLNG GVGATWHLAD DLLGFALATA RLERNGDFAA
FLSPAAGFDA GLLWRNRLGN LGLEARADYF HNGEVRRELS LVQQWEIAGN LGLRLAARRE
FSQQGAPASE LSLQLRWYHY