Gene Avin_34040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34040 
Symbol 
ID7762299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3477268 
End bp3478563 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID643806265 
ProductOprD family outer membrane porin 
Protein accessionYP_002800529 
Protein GI226945456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACAT ATACACCGTG CGGCTGGCGC GCGCTGGCCC TGACTCTGTC AACGGCATCC 
ATCCAGCCGA CCATGGCCGC TCCGCAGGCC GAGGCGCGGG GATTCGCCGA AGAGGCCGGT
GCCCACCTGA TGCTGCGCAA TGGATACGTG TACCGCGACA ACCAGGACGG CGTGCGCGAC
CAGTCGCGCT GGGCCCAGGC GTTCATCGCC AGTTTCGACT CCGGTTTCAG CCGCGGTCCC
GTCGGCGTCG GCCTGGATGC CTTCGGCCTC CTCGCCGTCC GGCTCGACAC CGGCAAGAGT
CGCGACGACG GCGATATCGC CTACTTTCCC ACCGACAGCG ACGGCGACAC CGAAGAGGAT
CTGGGCGAAC TCGGGATCGC CTTCAAACTG CGGATTTCCA ATACGCTGCT GCAATACGGC
GATCAGCTCC CCGCCCTGCC GGTGCTGTCC TACGACAGCT CCCGCCTGCT GCCGCAGACC
TTCCGCGGCG TGCTGCTCAG CAGCGAGGAG ATCGACGGCC TGACCCTGCA CGCCGGGCGC
TTCACGGCGC AGAACGACAA CAACCACAGC GGCAGGGACG TTCCGGGACG GGAACTGGAT
TCGATCGAAC TCGTCGGCGC CAGCTATGCC TTCAACGAGC GGCTGAGCGT TGCCCTGTAT
TTTTCCGATA TCGAGAAGGT GGCCAGAAAG CGTTACGCCA ATATCGTCTG GCAATTGCCG
CTCGCCGAAG AAAACACCCT GGAGTTCGAC TTCGATTTCT ACCGGACCCG CTACGACCGG
GACTACACCC GGACCGGCAA GGGCGAGGAC AACAGCATCT GGAGCCTGAT GGGCACCTAT
CGCCGGGGGC CTCATGCCTT CATCCTCGCC TGGCAGCGCT CGATCGGCGG CTTCGAGAAT
CTCGACGAGG ACGGCCAGCC GGTCACCCAC GGCTACGACT TCGATTTCGG CGACGGCGGC
GACGCCAATT ACCTGGCCAA CGCCTTCTAT TCCGACTTCA ACCGCAAGGA CGAACGCTCC
TGGCAGATCG GCTACGAACT GGACTTCGCC GATCTGGGCA TGCCGGGCCT GACCTGGAGA
ACCGCCTATG TCCATGGCAG CAAGATCGAC ACCGGCCGGG ACGGCACGGC GAGCGAGCGG
GAGTTCTACA ACCAGATCCA GTACGTGGTG CCGGAAGGCG TGGCCAAGGA CCTGTCGATC
AGCCTGTACG GCTCGATCTA TCGGGCCAGC CGCGACCTGA ACCAGGACCT GAACGAGATC
TGGCTGTTCG TCGACTACCC GCTCGACTAT CCCTGA
 
Protein sequence
MSTYTPCGWR ALALTLSTAS IQPTMAAPQA EARGFAEEAG AHLMLRNGYV YRDNQDGVRD 
QSRWAQAFIA SFDSGFSRGP VGVGLDAFGL LAVRLDTGKS RDDGDIAYFP TDSDGDTEED
LGELGIAFKL RISNTLLQYG DQLPALPVLS YDSSRLLPQT FRGVLLSSEE IDGLTLHAGR
FTAQNDNNHS GRDVPGRELD SIELVGASYA FNERLSVALY FSDIEKVARK RYANIVWQLP
LAEENTLEFD FDFYRTRYDR DYTRTGKGED NSIWSLMGTY RRGPHAFILA WQRSIGGFEN
LDEDGQPVTH GYDFDFGDGG DANYLANAFY SDFNRKDERS WQIGYELDFA DLGMPGLTWR
TAYVHGSKID TGRDGTASER EFYNQIQYVV PEGVAKDLSI SLYGSIYRAS RDLNQDLNEI
WLFVDYPLDY P