Gene Avin_31670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31670 
SymboloprE 
ID7762067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3275795 
End bp3277132 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content66% 
IMG OID643806041 
Productouter membrane porin OprE 
Protein accessionYP_002800305 
Protein GI226945232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGT CTTACCTGGC CCTGGCGGTC GTCCTCGGCG GGGCTGCGCA GCAAGCGGGC 
GCTGCCGGGT TCTTCGAGGA CAGCAAGGCC ACCCTGGGGC TGCGTAACCT CTATTTCAAC
CAGGACAACC GCGACGGCGC CGCCGATCCG TCCAAGCAGG AGGAATGGGG CCAGGGCTTC
ATCCTCGACT ACAAGTCCGG CTACACCCAG GGGCCGGTCG GTTTCGGCGT CGATGCCCTC
GGCCTGTTGG GCATCCGCCT CGACTCCGGC AAGGGCACCC ACTACAACCC GACCAGCGCC
AACAACAGCG GCCAGTTGTT CCCCACCGAA AGCAACGGCC GCGCCGTGCA CGAATACGGC
AGCGTGGGGC TGACCGCCAA GGTGCGTTTC TCCAAGACCG AGGCGCGCCT GGGCACCTTC
CTGCCCAAGC TGCCGGTGGT GACCCACAAC GACGGCCGCC TGCTGCCGCA GACCTTCGAG
GGCGCGCAGT TGAGCAGCAA CGAGATCGAC AACCTGACGC TGATCGGCGG CAAGTTCGAG
CGCGCCAAGG GCCGCAGCTC GACCGACAGC GGCCCGCTGT CGATCGCCGG CGCCAACAAC
GCGCAGACCG GCAAGTTCAG CAACCAGTTC TACTTCGCCG GCGGCGACTA CAAGGTCGGC
AAGAACCTCC TGCTGCAGTA CTACTACGGC AACCTGGAGG ACTTCTACGT GCAGCACTTC
CTCGGCCTGC AGCACGACTG GAAACTGCCG GTGGGTCTCC TGAAGACCGA CCTGCGCTAC
TTCAACAGCG ACTCCGACGG CAAGAACGCC AGCGTCTCCG GGCGTGCCGA AGGGTATCGC
AGCAGCGGCT ACTGGTCGGC CGGCGATTCG GAGCGGGGCG AAGTCGACAA CCGCGCCTGG
AGCGCCAAGT TCACCTACCT GCTGGACGCC CACGAGCTGA GCTTCGGCGT GCAGCGGCTG
TCCGGCAACA GCGACTTCCC GGTGCTCAAC CAGGGCGACG GCTACACCGC CTATCTGATC
ACCGACAGCC AGATCAACAA GTTCCTGCGC GCCGGCGAGC GTACCTGGCG GGCCAGCTAT
GCCTACGATT TCGCCAAGCT CGGCGTGCCG GGCCTGAAGG CCTCGGCGGT CTACCTGTAC
GGCGACAACA TCGACACCGA CGGCAGCGAC GCCAGCGAGT GGGAACGCAA CCTGCGCCTG
GACTACGTCC TGCAGGGCGG CCTGTTCAAG GGCGTCGGTT TCTCCTACCG GCACGCCACG
CTGCGCAGCG ACGTGGCCTC GCAGCGCAGC ATCGACGAGA ACCGCCTGTA CATCACCTAC
AGCCTGCCGC TGCTCTGA
 
Protein sequence
MNKSYLALAV VLGGAAQQAG AAGFFEDSKA TLGLRNLYFN QDNRDGAADP SKQEEWGQGF 
ILDYKSGYTQ GPVGFGVDAL GLLGIRLDSG KGTHYNPTSA NNSGQLFPTE SNGRAVHEYG
SVGLTAKVRF SKTEARLGTF LPKLPVVTHN DGRLLPQTFE GAQLSSNEID NLTLIGGKFE
RAKGRSSTDS GPLSIAGANN AQTGKFSNQF YFAGGDYKVG KNLLLQYYYG NLEDFYVQHF
LGLQHDWKLP VGLLKTDLRY FNSDSDGKNA SVSGRAEGYR SSGYWSAGDS ERGEVDNRAW
SAKFTYLLDA HELSFGVQRL SGNSDFPVLN QGDGYTAYLI TDSQINKFLR AGERTWRASY
AYDFAKLGVP GLKASAVYLY GDNIDTDGSD ASEWERNLRL DYVLQGGLFK GVGFSYRHAT
LRSDVASQRS IDENRLYITY SLPLL