Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3216 |
Symbol | |
ID | 6976656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3520057 |
End bp | 3523047 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643392729 |
Product | virulence factor SrfB-like protein |
Protein accession | YP_002277561 |
Protein GI | 209545332 |
COG category | [S] Function unknown |
COG ID | [COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.952482 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGGCT TGATCTCGTT GATTCCCGAC AGCCTGATCC AATTCGAGAC CATTACGCTG GACGCCGCCG AATGCGGCAG AATCCGGGCG AATTTCTGGG AGGAGTCCAC CGGGATCAAC GGGGACGGGG CGCCGGAATT CATCCTGCGC GCGCTTGAGG AAGACCCCGA AAGCAGGAAC ATGGTGTTCC ACGCCGATGG CCGCCAGCGG GTGGCGCCGG ACGATGCAGG GTATGTCATC AGGCGCAAGG ACATTTTCGA GGTTTTCGAG AATGAGTGGG TGCCGCTGCC GTATCTTGCG CGGATCGGAA GCACCGACAA TGAGGGATTT CGCCCCGGTC CGGCAAACTG GGTCAGGGGC CGGCTGCGCC GCACGGAAAC CCGGGAAGGG GAGGAGGCGA TTCTCCTCAA CATCGCCTTC GACACCACGG TCGCGCCGCA TTTCGACGGG CGCTACTTGG CGCCGACCCC CGATGACGAG GCGCGGGGCC AGAAATTCGC GTTCGTCGCC GACCTGGAAA GCATTTCCTG GCTGATCGAC AGTTCCTGGC TGAAGGACTG GCTGGAGGAA TTGCAGCGCG AGGCCGCGCT TGCCGTCCCC GGCCGGAAGC GCCCGGCGCC CGAGGAGGAG GAAAAGCAGC CGGTCCTGAA GCATCTTGCG ACCTATTTGG CCTTCCTGCG GATGGTGGCG CAGGCCGGAG CGCCGCCGAT CGCGCAGATC CTGACGGTGA CGAACGAACC GCCGGTCGCG GTCGATCTGG TGCTGGATCT GGGGAATTTT CGCAGTTGCG GTATCCTGAT CGAGGAACAT CCGGGGACGC GCCAGCGCGA AGCCGACAGC TACGTCCTGG AACTGCGCGA CCTCGCCCAG CCTGAACTGG CCTATGGCGA CCCGTTTTCC AGCCGGATCG AGTTCGGACG GGCGACGTTC GGGCGGGATT CCCATTCGCG CCGCTCCGGC CGGCCCCATG CCTTCGTGTG GCAGAGCCCC GTGCGGACCG GGCCCGAGGC CGAACGGATG ATGGGGGCAC GGATCGGTAA CGAGGGGCTG TCGGGCCTGT CTGCTCCCAA ACGCTATCTG TGGGACACCC GGCCGTCCGA CCAGGGATGG CGCTTCAATA ATGGCCTTAA CCGAGACGGC CAGCCGTCCG ATCCGGCGGT CAACGGCCCG TTCCGCCGTG CGATCCAGGA AACGCTGGGC GCCCAGCGGC GGTTCAGGGT CAGCCTGGAC GACGTGAAGG CGGCGCAGCC CGTGGGGCCG AGCGTGCTGT CGCGCAGCCT GCTGTTCATG CTGATGTTGA GCGAACTGAT CATGCAGGCG ATCATGTATA TGAATTCGCC GGGCTTTCGG TCCAAGCGGC GGGACAGCGC GCGTCCGCGC CTGCTGCGTT CGGTGATGCT GACAATGCCG CCTGGAATGC CCGTGGCGGA ACAGCGGATT TTCCGTGCCC GGGCCGAGGC GGCGATCGCG TTCGTCTGGA GCGTGACGGG GCGCACGGGC AAGCCGCCAA CGCTGCGCGC CGAACTGGAC GAGGCCACGG CGACGCAGAT CGTCTGGCTG CATAACGAGG TGACGGAGCG GCTCGGCGGC CATGTCGAGG CGCTGTTCGA TCTATATGGC GACGGCCGCG TGGGCGAGGA CGGGCGGCCG GCAGTCCGTG TCGCGAGCAT CGACATCGGG GGCGGCACCA CCGACCTGAT GATCACGACC TATACCCATA TAGGGGGCGA CGCGCTGTCC CCGCACCAGG ATTTCCGCGA GAGCTTCAAG ATCGCCGGGG ATGACCTGCT TGAACGTGTC ATCCTGACCG TGGTGCTGCC CCCGTTGGAA AAGGCGCTGA CCGACGCGGG CATCGCCGAG GCGCACCAGC TTCTGGTCCG TCTGCTGGAA CGGGATTACG GCAACCAGGA CGAGCGCGAC CGTCACCGCC GCAGGCTGTT CGTTTCGACC GTTCTCGAAC CCACCGCGCT GCGGGCGCTG AGCCTCTACG AACAGGTGGA CGACCTGACG CAGGGGGGAA TCGGGCGATT CGCGCTGGGT GACGACACGG TGGCCCGTAG GGATGACGAC CATCGGAAAC GCTGTGCCGC TTTCCTGCGC GAGCAGGTCG ATACCCATAT GCGTGGCCTG AACATCGGCG ACGGCCCGCC CGATTTTGAC CTGTTCGGCG TGGTGGTCGA GGTCGAGGTG GAAACGATCG AGAAGGCGAT CCGCGCGACG CTGGATACGG TTCTGGGCAG CCTCACCGAA CTGGTCTGGG CCTATGGATG CGATGTCCTG CTGTTGTCGG GGCGCCCGTC GAAGCTGCGC CGGGTGAGGA ACCTGGTCAC GGGCGCGATG CCCGTCGCCC CGCACCGGAT CGTCGCGATG CACGATTACA CGGTGGGCCG GCACTATCCC TTCACGGATT CCGCGGGCCG GATCAGCGAC CCCAAGACGA CCGTCGTCGT CGGTGCGGCG CTGTGCATCA AGGCCGAGGG AAAGCTCCAG AATTTTGTCC TGAGAACGGG GAAGCTGGTC ATGCGCTCGA CAGCGCGGTT CCTAGGCAGG ATGGAGCGGA CCGGAATGAT CCGCAAAGCC GATATTCTGC TGGAGAACGT CGATCTGGAC GATCGTCCCA GCGACGAGGA TATCCGCTTC CCGCTGGGGG ATTTCGAAGG AATGACCATG ATCGGTTTCC GCCAGCTCCC GCTGGAGCGG TGGACGACGA CGCCTCTTTA TTGCGCGGAG TTCGCTGCTG GCACGCGCGA GGAAACGCAG CGCCTGGCCA TGCCGCTGAC CATCGAATTC GACCGGAAAC CCGATATCGA GGACGCCGAG GAAAAGGGTG ATTTCTCGGG ACGCGAGGAT TTTAGCGTCA GCGAGATTAC CGATGCCGAC GGGCAGCCTG TTCGTCGCAG CCTGATCATC CTGCGCTTGC AGACCACGCT GAACCCCGAA GGCTACTGGC GCGATACCGG GTGCCTGACG CTGGGAGCGA CCCTGCCGTG A
|
Protein sequence | MPGLISLIPD SLIQFETITL DAAECGRIRA NFWEESTGIN GDGAPEFILR ALEEDPESRN MVFHADGRQR VAPDDAGYVI RRKDIFEVFE NEWVPLPYLA RIGSTDNEGF RPGPANWVRG RLRRTETREG EEAILLNIAF DTTVAPHFDG RYLAPTPDDE ARGQKFAFVA DLESISWLID SSWLKDWLEE LQREAALAVP GRKRPAPEEE EKQPVLKHLA TYLAFLRMVA QAGAPPIAQI LTVTNEPPVA VDLVLDLGNF RSCGILIEEH PGTRQREADS YVLELRDLAQ PELAYGDPFS SRIEFGRATF GRDSHSRRSG RPHAFVWQSP VRTGPEAERM MGARIGNEGL SGLSAPKRYL WDTRPSDQGW RFNNGLNRDG QPSDPAVNGP FRRAIQETLG AQRRFRVSLD DVKAAQPVGP SVLSRSLLFM LMLSELIMQA IMYMNSPGFR SKRRDSARPR LLRSVMLTMP PGMPVAEQRI FRARAEAAIA FVWSVTGRTG KPPTLRAELD EATATQIVWL HNEVTERLGG HVEALFDLYG DGRVGEDGRP AVRVASIDIG GGTTDLMITT YTHIGGDALS PHQDFRESFK IAGDDLLERV ILTVVLPPLE KALTDAGIAE AHQLLVRLLE RDYGNQDERD RHRRRLFVST VLEPTALRAL SLYEQVDDLT QGGIGRFALG DDTVARRDDD HRKRCAAFLR EQVDTHMRGL NIGDGPPDFD LFGVVVEVEV ETIEKAIRAT LDTVLGSLTE LVWAYGCDVL LLSGRPSKLR RVRNLVTGAM PVAPHRIVAM HDYTVGRHYP FTDSAGRISD PKTTVVVGAA LCIKAEGKLQ NFVLRTGKLV MRSTARFLGR MERTGMIRKA DILLENVDLD DRPSDEDIRF PLGDFEGMTM IGFRQLPLER WTTTPLYCAE FAAGTREETQ RLAMPLTIEF DRKPDIEDAE EKGDFSGRED FSVSEITDAD GQPVRRSLII LRLQTTLNPE GYWRDTGCLT LGATLP
|
| |