Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_46800 |
Symbol | lptD |
ID | 7763543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4748380 |
End bp | 4751139 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643807524 |
Product | Organic solvent tolerance protein |
Protein accession | YP_002801760 |
Protein GI | 226946687 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.532703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTCA CTCCTCCCCG CTCGTTGTTC CGCAGAAAGT TTCCTCTTCT GGTCACCGGT AGCCTGTTGG TTCTGGGCTC GGCTCGCCTG CTGGCCGCCG AGCAGTTCGA CTGCCGGCCC TCCGCCGCGG GTGGCTGGGA CTGCGCCCCC AAGGCCAGCG CCCAACCCCT GCCGCCGCGC CCGCACGAAG CCGCCAGCGC GGCGGCCGCT CCCGGCGCGC AGCCCGCCGG CAAGGAGAAG GCCGCGCCGA CCCTGGTCAC CGAGAGCGAA GGCTTCGCCC TGGCTTCGCG CAGCGCCGAC TACAGCCACC TGGACTGGGT ACCGAGGGAG AAGCTGAGCC CGGCGCAACT GGCCGAAATC AGTCCCTACT GCACCGGCAC CTACGTCGAG CCGCCACGAG TGGGCATGGA CGACAGCACG CCGATGCGCG ACGCGCCGAC CTACGTCTCC GCCCGCGCGT CGCGCTACGA CCAGGAGAAG CAGATCGCCT CGATCGCCGG CGACGTGGTG CTGCGCCAGG GCAGCATGCA GGTGGAGGCC GACGAGGCGC GCCTGCACCA GCAGGAAAGC CGGGGCGAGG TGCTCGGCAA CGTCAGGCTG CGCGACAAGG GCTTTTTGGT CGTCGGCGAC CGCGCCGAGC TGCTGATGGA AAACGGCGAG GCGCAAGTGG AGAACGCCGA GTACGTCGTC CACAGCGCCC ACGCTCGCGG CAGCGCGCTC AAGGCCAAGC GCGAGGAAAC CTCGATCATC CGCCTCAAGG ACGGTACCTA CACCACCTGC GAGCCGGGCA GCAACACCTG GACCCTGAGC GGCAAGAACA TCAAGCTGGA CCCGGTCAGC GGCCGCGGCA CTGCCACCCA CGTGACGCTG CGGGTACACG ACCTGCCGGT GTTCTACACC CCGTACATCC AGTTCCCGAT CGACAACCGC CGCCAGTCCG GTTTCCTGAC GCCGGGCTTC AGCAGTTCGG GCAGCAGCGG CCTCTCCCTG CAGGCGCCCT ACTACTTCAA CCTGGCGCCG AACTACGACG CCACGCTCTA CCCGACCTAC ATGACCGACC GCGGCCTGCT GCTGGAAGGC GAGTTCCGCT ACCTGACCAA GACCAGCGAA GGCCAGGTCG GCGCGGCCTA CATCGACGAC AAGGAAGACG AGCGCGAGCT GCAGTCGGAC TACGAGGACC AGCGCTGGAT GTACAGCTGG CAGCACATCG GCGGGCTGAA CTCGCGCCTG ATGGCCGAGG TCGACTTCAC CGACATCAGC GACCCCTACT ACTTCCAGGA CCTGCGCACC GACCTGGGCA TCAACCAGCC GGACTTCCTC AACCAGCGCG GCACCCTGAG CTGGCGCGGC GACACCTTCA CCGCGCGCCT CAACGCGCAC GCCTTCGAGC GCACCACGAT CGCCGACCGC ACGCCCTACG ACATCCTGCC GCAGGTCAGC CTCGACGGCT TCCTGCCGTA CCGCCCGTAC GGCCTGTACT TCACCTACGG CACCGAGTAC GCCAAGTTCG AGCGCGACCT GCGCACCGGC TTCTTCACCG ACGAGGACGG CAACACGGAC GAACGCTGGT ACGACGAATA CATTACCGGG CTGGATCGCG CCGAAGGCGA CCGCACCCAC CTGGAGCCGG GCGTCAGCCT GCCGCTGGAG TGGACCTGGG GCTACCTCAA GCCCTCGGTC AAGATCGCCC ATACCCGCTA CGACCTGGAT CTGGACGCCC AGGGCAAGAC CACACTCGGC GGCCAGCACT TCGACGACAG CCCGGACCGC ACCGTGCCGC TCTACAGCCT GGAGGGCGGC CTCTACTTCG ACCGCAACGT CAACTGGTTC GGCAAGGGCT TCCGGCAGAC CCTGGAGCCG CGCATGTACT ACCTCTACGT GCCGTACCGC GACCAGGAGG ACATACCGGT ATTCGATACC GGCGAGCACG TCTTCAGCTA CGCCTCGCTG TGGCGCGACA ACCGCTTCAC CGGCAAGGAC CGCATCGGCG ACGCCAACCA GTTGTCGCTC GGCGTGACCA GCCGCTGGAT CGAAGCCAAC GGCTTCGAAC GCCAGCGCAT CAGCTTCGGC CAGACCTTCT ACTTCCAGGA TCGCCGGGTG CAGATGCCGC GCGTGGACTA CGAGAAACGC GACGACTCCG AGTCCAAGGT CTCGCCCTAC GCGCTGGAAT ATGTGTACCG CTTCAACCAC GACTGGCGCC TGACCTCCGA GTTCAACTGG GACCCGGACG AGCACCATCC CCGCTCCAGC AACGTCATGT TCCATTACCA GCCGGCCGAC AATCCGAACA AGATCGTCAA CCTGGGCTAT CGCTACCGCA ACGACGTGAT GCGCTACAAC CAGTCCACCG GCACCTGGGA CTACAGTACC GACTTCGGCA ACTGCGACAC CGACCCGGAC TGCATCAAGG ACTACTACAA GATCGACCAG CACGACTTCT CGACCATCTG GCCGCTCGCC CCGCACTGGA GCGCCATCGC CCGCTGGCAG TACGACTACA GCCGCGATCG CACCCTGGAA GCCTTCGGCG GCTTCGAGTA CGACAGTTGT TGCTGGAAGC TGCGCCTGAT CAGCCGCTAC TGGATCGGCT ACGACGAGAA CGAACTCAAC CCGGATCAGA ACGACGACGC CGACAAGGGC CTCTTCCTGC AGGTGGTGTT CAAGGGCCTC GGCGGCGTGA TGGGCAACAA GGTGGAGGCG TTCCTCGACC AAGGCATCGA AGGTTACCTC GAACGTGAAC AGCAACAAAA AGCTCACTGA
|
Protein sequence | MALTPPRSLF RRKFPLLVTG SLLVLGSARL LAAEQFDCRP SAAGGWDCAP KASAQPLPPR PHEAASAAAA PGAQPAGKEK AAPTLVTESE GFALASRSAD YSHLDWVPRE KLSPAQLAEI SPYCTGTYVE PPRVGMDDST PMRDAPTYVS ARASRYDQEK QIASIAGDVV LRQGSMQVEA DEARLHQQES RGEVLGNVRL RDKGFLVVGD RAELLMENGE AQVENAEYVV HSAHARGSAL KAKREETSII RLKDGTYTTC EPGSNTWTLS GKNIKLDPVS GRGTATHVTL RVHDLPVFYT PYIQFPIDNR RQSGFLTPGF SSSGSSGLSL QAPYYFNLAP NYDATLYPTY MTDRGLLLEG EFRYLTKTSE GQVGAAYIDD KEDERELQSD YEDQRWMYSW QHIGGLNSRL MAEVDFTDIS DPYYFQDLRT DLGINQPDFL NQRGTLSWRG DTFTARLNAH AFERTTIADR TPYDILPQVS LDGFLPYRPY GLYFTYGTEY AKFERDLRTG FFTDEDGNTD ERWYDEYITG LDRAEGDRTH LEPGVSLPLE WTWGYLKPSV KIAHTRYDLD LDAQGKTTLG GQHFDDSPDR TVPLYSLEGG LYFDRNVNWF GKGFRQTLEP RMYYLYVPYR DQEDIPVFDT GEHVFSYASL WRDNRFTGKD RIGDANQLSL GVTSRWIEAN GFERQRISFG QTFYFQDRRV QMPRVDYEKR DDSESKVSPY ALEYVYRFNH DWRLTSEFNW DPDEHHPRSS NVMFHYQPAD NPNKIVNLGY RYRNDVMRYN QSTGTWDYST DFGNCDTDPD CIKDYYKIDQ HDFSTIWPLA PHWSAIARWQ YDYSRDRTLE AFGGFEYDSC CWKLRLISRY WIGYDENELN PDQNDDADKG LFLQVVFKGL GGVMGNKVEA FLDQGIEGYL EREQQQKAH
|
| |