Gene Avin_32900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_32900 
Symbol 
ID7762188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3367795 
End bp3369843 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content69% 
IMG OID643806158 
ProductOligopeptidase B protein 
Protein accessionYP_002800422 
Protein GI226945349 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAAC CCCCCATCGC CCGAATCGAG AGCCATTGTG CCGATCCCTA CCGCTGGCTG 
GAGCAGCGCG ACGACCCACA GGTGCTCGCC TATCTGGAAG CGGAGAACGC CTACCTGGAG
GCCGAGCTCG CGGATGTCGG CGCGCTGCGC GAGAGCCTGT TCCAGGAGGT CAAGGGCCGC
ATCCGCGAAA CCGACCTGTC CCTGCCGGTA CCCTGGGGAC CCTGGCTCTA CTACCAGCGC
ACCACAGCCG GCGACGAATA CCCGCGCCAC TACCGCTGCC CGCGCCCGGC GGACGGCTCG
CTACGGACCG ACACCAGCCG CGAGGAACTG CTGCTCGACC CAAATGCCCT GGCCGACGGC
GGCTACCTGT CGCTCGGCGC CTTCGAGATC AGCCCCGACC ACCGTTGCCT GGCCTACAGC
CTGGATAGCA GCGGCGACGA GATCTACCGG CTGTTCGTCA GGGAGCTGGA CAGCGGCGTG
CTGCACGAAC TGCCCTTCGA GGACTGCGAC GGCAACATGA CCTGGGCCAA CGACAGCCGG
ACGCTGTTCT TCGCCGAACT GGACGACACC CAGCGCCCGC ACAGGCTGTA CCGCTACCGG
CTCGGCGACG ACGAGACCGA GCTGGTGTTC AACGAGCCGG ACGGACGCTT CTTCATTCAT
TGCTACCGCG CCAGCTCCGA GCGCCAGTTG CTCCTCCTGA GCAGCAGCAA GACCACCTGC
GAGGCCTGGA CGCTGGATGC CGAACGCCCG CAGGAAGCCT TCGTCTGCCT GGCGCCGCGC
CAGGAGGGCC ACGAGTACTA CCCCGACCAC GGGCGCATCG ACGGCGACTG GGGCTGGCTG
ATCCGCAGCA ACCAGGACGG CATCGAATTC GCCGTCTACC AGGCCCCCGA GGGTGCGCCG
GGGCGCGAGC ACTGGCGGCC GCGGATCGCT CACGACGCGG CGCGGATGAT CGAGGACCTC
AGCCTGAACG CCGCGGGCTT CGTCCTCAGC CTGCGCGAGA AGGGTCTGCC GATCGTCGAG
GTGCACCCGG CCGAGGCCAC GCCCTACCGC GTTGAACTGC CGGACGCCGC CTACAGCCTG
GACGTGCAGG ACATCCTGGA ATTCGACAGC CCGGTGATCC GCCTGCGCTA CGAGGCGCTC
AACCGCCCGG CGCAGATCCG CCAGTTGGAC CTGGCCAGCG GCGCCCAGCG AGTGCTCAAG
GAAACCCCGG TGGAAGGACC GTTCGACGCC GACGACTATC TCAGCCTGCG GCTCTGGGCC
GAGGCGGCCG ACGGCGCGCG CATCCCGGTC AGCCTGGTCG CCCGCCGCGA CATCCTCCGA
GGCGAGGGGC AAAAGCGCCC CGCTCCGCTC TATCTCTATG GCTACGGCGC CTATGGCGAG
AGCCTCGACC CCTGGTTCTC CCACGCCCGG CTGAGCCTCC TGGAGCGCGG CTTCGTCTTC
GCCATCGCCC ATGTGCGCGG CGGCGGCGAA CTGGGCGAGG CCTGGTACCG CGCCGGCAAG
CTGGAACACA AGGAAAACAC CTTCGGCGAC TTCATCGCCG TGGCCGAGCA CCTGATCGCC
GAGGGCGTCA CCTGCGCCGA CCGGCTGGCG ATCAGCGGCG GCAGCGCCGG CGGCCTGCTG
ATCGGCGCCG TGCTCAACCG GCGTCCGGAG CTGTTCGCCG CGGCGATCGC CGAGGTGCCC
TTCGTCGACG TGCTGAACAC CATGCACAAC CCCGAGCTGC CGCTGACCGT CACCGAGTAC
GACGAGTGGG GCGATCCGCG CGATCCCGAG GTCCATGCCC GGATCGCCGC CTACGCCCCC
TACGAGAACG TGCGCGCCCA GGCCTACCCG GCGATCCTCG CGGTGGCCAG CTACCACGAC
AGCCGGGTGC AGTACTGGGA GGCGGCGAAG TGGGTGGCCA GGCTGCGCGC CAGCAAGACC
GACGCCAACC TGCTGCTGCT GAAGACCGAG TTCGGCGCCG GCCACGGCGG CATGAGCGGA
CGCTATCAGG CACTCAGGGA CGTGGCGCTG GAATACGCCT TCCTGCTCAG GGTGCTCGGC
CGGGTCTGA
 
Protein sequence
MPQPPIARIE SHCADPYRWL EQRDDPQVLA YLEAENAYLE AELADVGALR ESLFQEVKGR 
IRETDLSLPV PWGPWLYYQR TTAGDEYPRH YRCPRPADGS LRTDTSREEL LLDPNALADG
GYLSLGAFEI SPDHRCLAYS LDSSGDEIYR LFVRELDSGV LHELPFEDCD GNMTWANDSR
TLFFAELDDT QRPHRLYRYR LGDDETELVF NEPDGRFFIH CYRASSERQL LLLSSSKTTC
EAWTLDAERP QEAFVCLAPR QEGHEYYPDH GRIDGDWGWL IRSNQDGIEF AVYQAPEGAP
GREHWRPRIA HDAARMIEDL SLNAAGFVLS LREKGLPIVE VHPAEATPYR VELPDAAYSL
DVQDILEFDS PVIRLRYEAL NRPAQIRQLD LASGAQRVLK ETPVEGPFDA DDYLSLRLWA
EAADGARIPV SLVARRDILR GEGQKRPAPL YLYGYGAYGE SLDPWFSHAR LSLLERGFVF
AIAHVRGGGE LGEAWYRAGK LEHKENTFGD FIAVAEHLIA EGVTCADRLA ISGGSAGGLL
IGAVLNRRPE LFAAAIAEVP FVDVLNTMHN PELPLTVTEY DEWGDPRDPE VHARIAAYAP
YENVRAQAYP AILAVASYHD SRVQYWEAAK WVARLRASKT DANLLLLKTE FGAGHGGMSG
RYQALRDVAL EYAFLLRVLG RV