Gene Avin_46800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_46800 
SymbollptD 
ID7763543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4748380 
End bp4751139 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content66% 
IMG OID643807524 
ProductOrganic solvent tolerance protein 
Protein accessionYP_002801760 
Protein GI226946687 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.532703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTCA CTCCTCCCCG CTCGTTGTTC CGCAGAAAGT TTCCTCTTCT GGTCACCGGT 
AGCCTGTTGG TTCTGGGCTC GGCTCGCCTG CTGGCCGCCG AGCAGTTCGA CTGCCGGCCC
TCCGCCGCGG GTGGCTGGGA CTGCGCCCCC AAGGCCAGCG CCCAACCCCT GCCGCCGCGC
CCGCACGAAG CCGCCAGCGC GGCGGCCGCT CCCGGCGCGC AGCCCGCCGG CAAGGAGAAG
GCCGCGCCGA CCCTGGTCAC CGAGAGCGAA GGCTTCGCCC TGGCTTCGCG CAGCGCCGAC
TACAGCCACC TGGACTGGGT ACCGAGGGAG AAGCTGAGCC CGGCGCAACT GGCCGAAATC
AGTCCCTACT GCACCGGCAC CTACGTCGAG CCGCCACGAG TGGGCATGGA CGACAGCACG
CCGATGCGCG ACGCGCCGAC CTACGTCTCC GCCCGCGCGT CGCGCTACGA CCAGGAGAAG
CAGATCGCCT CGATCGCCGG CGACGTGGTG CTGCGCCAGG GCAGCATGCA GGTGGAGGCC
GACGAGGCGC GCCTGCACCA GCAGGAAAGC CGGGGCGAGG TGCTCGGCAA CGTCAGGCTG
CGCGACAAGG GCTTTTTGGT CGTCGGCGAC CGCGCCGAGC TGCTGATGGA AAACGGCGAG
GCGCAAGTGG AGAACGCCGA GTACGTCGTC CACAGCGCCC ACGCTCGCGG CAGCGCGCTC
AAGGCCAAGC GCGAGGAAAC CTCGATCATC CGCCTCAAGG ACGGTACCTA CACCACCTGC
GAGCCGGGCA GCAACACCTG GACCCTGAGC GGCAAGAACA TCAAGCTGGA CCCGGTCAGC
GGCCGCGGCA CTGCCACCCA CGTGACGCTG CGGGTACACG ACCTGCCGGT GTTCTACACC
CCGTACATCC AGTTCCCGAT CGACAACCGC CGCCAGTCCG GTTTCCTGAC GCCGGGCTTC
AGCAGTTCGG GCAGCAGCGG CCTCTCCCTG CAGGCGCCCT ACTACTTCAA CCTGGCGCCG
AACTACGACG CCACGCTCTA CCCGACCTAC ATGACCGACC GCGGCCTGCT GCTGGAAGGC
GAGTTCCGCT ACCTGACCAA GACCAGCGAA GGCCAGGTCG GCGCGGCCTA CATCGACGAC
AAGGAAGACG AGCGCGAGCT GCAGTCGGAC TACGAGGACC AGCGCTGGAT GTACAGCTGG
CAGCACATCG GCGGGCTGAA CTCGCGCCTG ATGGCCGAGG TCGACTTCAC CGACATCAGC
GACCCCTACT ACTTCCAGGA CCTGCGCACC GACCTGGGCA TCAACCAGCC GGACTTCCTC
AACCAGCGCG GCACCCTGAG CTGGCGCGGC GACACCTTCA CCGCGCGCCT CAACGCGCAC
GCCTTCGAGC GCACCACGAT CGCCGACCGC ACGCCCTACG ACATCCTGCC GCAGGTCAGC
CTCGACGGCT TCCTGCCGTA CCGCCCGTAC GGCCTGTACT TCACCTACGG CACCGAGTAC
GCCAAGTTCG AGCGCGACCT GCGCACCGGC TTCTTCACCG ACGAGGACGG CAACACGGAC
GAACGCTGGT ACGACGAATA CATTACCGGG CTGGATCGCG CCGAAGGCGA CCGCACCCAC
CTGGAGCCGG GCGTCAGCCT GCCGCTGGAG TGGACCTGGG GCTACCTCAA GCCCTCGGTC
AAGATCGCCC ATACCCGCTA CGACCTGGAT CTGGACGCCC AGGGCAAGAC CACACTCGGC
GGCCAGCACT TCGACGACAG CCCGGACCGC ACCGTGCCGC TCTACAGCCT GGAGGGCGGC
CTCTACTTCG ACCGCAACGT CAACTGGTTC GGCAAGGGCT TCCGGCAGAC CCTGGAGCCG
CGCATGTACT ACCTCTACGT GCCGTACCGC GACCAGGAGG ACATACCGGT ATTCGATACC
GGCGAGCACG TCTTCAGCTA CGCCTCGCTG TGGCGCGACA ACCGCTTCAC CGGCAAGGAC
CGCATCGGCG ACGCCAACCA GTTGTCGCTC GGCGTGACCA GCCGCTGGAT CGAAGCCAAC
GGCTTCGAAC GCCAGCGCAT CAGCTTCGGC CAGACCTTCT ACTTCCAGGA TCGCCGGGTG
CAGATGCCGC GCGTGGACTA CGAGAAACGC GACGACTCCG AGTCCAAGGT CTCGCCCTAC
GCGCTGGAAT ATGTGTACCG CTTCAACCAC GACTGGCGCC TGACCTCCGA GTTCAACTGG
GACCCGGACG AGCACCATCC CCGCTCCAGC AACGTCATGT TCCATTACCA GCCGGCCGAC
AATCCGAACA AGATCGTCAA CCTGGGCTAT CGCTACCGCA ACGACGTGAT GCGCTACAAC
CAGTCCACCG GCACCTGGGA CTACAGTACC GACTTCGGCA ACTGCGACAC CGACCCGGAC
TGCATCAAGG ACTACTACAA GATCGACCAG CACGACTTCT CGACCATCTG GCCGCTCGCC
CCGCACTGGA GCGCCATCGC CCGCTGGCAG TACGACTACA GCCGCGATCG CACCCTGGAA
GCCTTCGGCG GCTTCGAGTA CGACAGTTGT TGCTGGAAGC TGCGCCTGAT CAGCCGCTAC
TGGATCGGCT ACGACGAGAA CGAACTCAAC CCGGATCAGA ACGACGACGC CGACAAGGGC
CTCTTCCTGC AGGTGGTGTT CAAGGGCCTC GGCGGCGTGA TGGGCAACAA GGTGGAGGCG
TTCCTCGACC AAGGCATCGA AGGTTACCTC GAACGTGAAC AGCAACAAAA AGCTCACTGA
 
Protein sequence
MALTPPRSLF RRKFPLLVTG SLLVLGSARL LAAEQFDCRP SAAGGWDCAP KASAQPLPPR 
PHEAASAAAA PGAQPAGKEK AAPTLVTESE GFALASRSAD YSHLDWVPRE KLSPAQLAEI
SPYCTGTYVE PPRVGMDDST PMRDAPTYVS ARASRYDQEK QIASIAGDVV LRQGSMQVEA
DEARLHQQES RGEVLGNVRL RDKGFLVVGD RAELLMENGE AQVENAEYVV HSAHARGSAL
KAKREETSII RLKDGTYTTC EPGSNTWTLS GKNIKLDPVS GRGTATHVTL RVHDLPVFYT
PYIQFPIDNR RQSGFLTPGF SSSGSSGLSL QAPYYFNLAP NYDATLYPTY MTDRGLLLEG
EFRYLTKTSE GQVGAAYIDD KEDERELQSD YEDQRWMYSW QHIGGLNSRL MAEVDFTDIS
DPYYFQDLRT DLGINQPDFL NQRGTLSWRG DTFTARLNAH AFERTTIADR TPYDILPQVS
LDGFLPYRPY GLYFTYGTEY AKFERDLRTG FFTDEDGNTD ERWYDEYITG LDRAEGDRTH
LEPGVSLPLE WTWGYLKPSV KIAHTRYDLD LDAQGKTTLG GQHFDDSPDR TVPLYSLEGG
LYFDRNVNWF GKGFRQTLEP RMYYLYVPYR DQEDIPVFDT GEHVFSYASL WRDNRFTGKD
RIGDANQLSL GVTSRWIEAN GFERQRISFG QTFYFQDRRV QMPRVDYEKR DDSESKVSPY
ALEYVYRFNH DWRLTSEFNW DPDEHHPRSS NVMFHYQPAD NPNKIVNLGY RYRNDVMRYN
QSTGTWDYST DFGNCDTDPD CIKDYYKIDQ HDFSTIWPLA PHWSAIARWQ YDYSRDRTLE
AFGGFEYDSC CWKLRLISRY WIGYDENELN PDQNDDADKG LFLQVVFKGL GGVMGNKVEA
FLDQGIEGYL EREQQQKAH