Gene Gdia_3216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3216 
Symbol 
ID6976656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3520057 
End bp3523047 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content65% 
IMG OID643392729 
Productvirulence factor SrfB-like protein 
Protein accessionYP_002277561 
Protein GI209545332 
COG category[S] Function unknown 
COG ID[COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.952482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGCT TGATCTCGTT GATTCCCGAC AGCCTGATCC AATTCGAGAC CATTACGCTG 
GACGCCGCCG AATGCGGCAG AATCCGGGCG AATTTCTGGG AGGAGTCCAC CGGGATCAAC
GGGGACGGGG CGCCGGAATT CATCCTGCGC GCGCTTGAGG AAGACCCCGA AAGCAGGAAC
ATGGTGTTCC ACGCCGATGG CCGCCAGCGG GTGGCGCCGG ACGATGCAGG GTATGTCATC
AGGCGCAAGG ACATTTTCGA GGTTTTCGAG AATGAGTGGG TGCCGCTGCC GTATCTTGCG
CGGATCGGAA GCACCGACAA TGAGGGATTT CGCCCCGGTC CGGCAAACTG GGTCAGGGGC
CGGCTGCGCC GCACGGAAAC CCGGGAAGGG GAGGAGGCGA TTCTCCTCAA CATCGCCTTC
GACACCACGG TCGCGCCGCA TTTCGACGGG CGCTACTTGG CGCCGACCCC CGATGACGAG
GCGCGGGGCC AGAAATTCGC GTTCGTCGCC GACCTGGAAA GCATTTCCTG GCTGATCGAC
AGTTCCTGGC TGAAGGACTG GCTGGAGGAA TTGCAGCGCG AGGCCGCGCT TGCCGTCCCC
GGCCGGAAGC GCCCGGCGCC CGAGGAGGAG GAAAAGCAGC CGGTCCTGAA GCATCTTGCG
ACCTATTTGG CCTTCCTGCG GATGGTGGCG CAGGCCGGAG CGCCGCCGAT CGCGCAGATC
CTGACGGTGA CGAACGAACC GCCGGTCGCG GTCGATCTGG TGCTGGATCT GGGGAATTTT
CGCAGTTGCG GTATCCTGAT CGAGGAACAT CCGGGGACGC GCCAGCGCGA AGCCGACAGC
TACGTCCTGG AACTGCGCGA CCTCGCCCAG CCTGAACTGG CCTATGGCGA CCCGTTTTCC
AGCCGGATCG AGTTCGGACG GGCGACGTTC GGGCGGGATT CCCATTCGCG CCGCTCCGGC
CGGCCCCATG CCTTCGTGTG GCAGAGCCCC GTGCGGACCG GGCCCGAGGC CGAACGGATG
ATGGGGGCAC GGATCGGTAA CGAGGGGCTG TCGGGCCTGT CTGCTCCCAA ACGCTATCTG
TGGGACACCC GGCCGTCCGA CCAGGGATGG CGCTTCAATA ATGGCCTTAA CCGAGACGGC
CAGCCGTCCG ATCCGGCGGT CAACGGCCCG TTCCGCCGTG CGATCCAGGA AACGCTGGGC
GCCCAGCGGC GGTTCAGGGT CAGCCTGGAC GACGTGAAGG CGGCGCAGCC CGTGGGGCCG
AGCGTGCTGT CGCGCAGCCT GCTGTTCATG CTGATGTTGA GCGAACTGAT CATGCAGGCG
ATCATGTATA TGAATTCGCC GGGCTTTCGG TCCAAGCGGC GGGACAGCGC GCGTCCGCGC
CTGCTGCGTT CGGTGATGCT GACAATGCCG CCTGGAATGC CCGTGGCGGA ACAGCGGATT
TTCCGTGCCC GGGCCGAGGC GGCGATCGCG TTCGTCTGGA GCGTGACGGG GCGCACGGGC
AAGCCGCCAA CGCTGCGCGC CGAACTGGAC GAGGCCACGG CGACGCAGAT CGTCTGGCTG
CATAACGAGG TGACGGAGCG GCTCGGCGGC CATGTCGAGG CGCTGTTCGA TCTATATGGC
GACGGCCGCG TGGGCGAGGA CGGGCGGCCG GCAGTCCGTG TCGCGAGCAT CGACATCGGG
GGCGGCACCA CCGACCTGAT GATCACGACC TATACCCATA TAGGGGGCGA CGCGCTGTCC
CCGCACCAGG ATTTCCGCGA GAGCTTCAAG ATCGCCGGGG ATGACCTGCT TGAACGTGTC
ATCCTGACCG TGGTGCTGCC CCCGTTGGAA AAGGCGCTGA CCGACGCGGG CATCGCCGAG
GCGCACCAGC TTCTGGTCCG TCTGCTGGAA CGGGATTACG GCAACCAGGA CGAGCGCGAC
CGTCACCGCC GCAGGCTGTT CGTTTCGACC GTTCTCGAAC CCACCGCGCT GCGGGCGCTG
AGCCTCTACG AACAGGTGGA CGACCTGACG CAGGGGGGAA TCGGGCGATT CGCGCTGGGT
GACGACACGG TGGCCCGTAG GGATGACGAC CATCGGAAAC GCTGTGCCGC TTTCCTGCGC
GAGCAGGTCG ATACCCATAT GCGTGGCCTG AACATCGGCG ACGGCCCGCC CGATTTTGAC
CTGTTCGGCG TGGTGGTCGA GGTCGAGGTG GAAACGATCG AGAAGGCGAT CCGCGCGACG
CTGGATACGG TTCTGGGCAG CCTCACCGAA CTGGTCTGGG CCTATGGATG CGATGTCCTG
CTGTTGTCGG GGCGCCCGTC GAAGCTGCGC CGGGTGAGGA ACCTGGTCAC GGGCGCGATG
CCCGTCGCCC CGCACCGGAT CGTCGCGATG CACGATTACA CGGTGGGCCG GCACTATCCC
TTCACGGATT CCGCGGGCCG GATCAGCGAC CCCAAGACGA CCGTCGTCGT CGGTGCGGCG
CTGTGCATCA AGGCCGAGGG AAAGCTCCAG AATTTTGTCC TGAGAACGGG GAAGCTGGTC
ATGCGCTCGA CAGCGCGGTT CCTAGGCAGG ATGGAGCGGA CCGGAATGAT CCGCAAAGCC
GATATTCTGC TGGAGAACGT CGATCTGGAC GATCGTCCCA GCGACGAGGA TATCCGCTTC
CCGCTGGGGG ATTTCGAAGG AATGACCATG ATCGGTTTCC GCCAGCTCCC GCTGGAGCGG
TGGACGACGA CGCCTCTTTA TTGCGCGGAG TTCGCTGCTG GCACGCGCGA GGAAACGCAG
CGCCTGGCCA TGCCGCTGAC CATCGAATTC GACCGGAAAC CCGATATCGA GGACGCCGAG
GAAAAGGGTG ATTTCTCGGG ACGCGAGGAT TTTAGCGTCA GCGAGATTAC CGATGCCGAC
GGGCAGCCTG TTCGTCGCAG CCTGATCATC CTGCGCTTGC AGACCACGCT GAACCCCGAA
GGCTACTGGC GCGATACCGG GTGCCTGACG CTGGGAGCGA CCCTGCCGTG A
 
Protein sequence
MPGLISLIPD SLIQFETITL DAAECGRIRA NFWEESTGIN GDGAPEFILR ALEEDPESRN 
MVFHADGRQR VAPDDAGYVI RRKDIFEVFE NEWVPLPYLA RIGSTDNEGF RPGPANWVRG
RLRRTETREG EEAILLNIAF DTTVAPHFDG RYLAPTPDDE ARGQKFAFVA DLESISWLID
SSWLKDWLEE LQREAALAVP GRKRPAPEEE EKQPVLKHLA TYLAFLRMVA QAGAPPIAQI
LTVTNEPPVA VDLVLDLGNF RSCGILIEEH PGTRQREADS YVLELRDLAQ PELAYGDPFS
SRIEFGRATF GRDSHSRRSG RPHAFVWQSP VRTGPEAERM MGARIGNEGL SGLSAPKRYL
WDTRPSDQGW RFNNGLNRDG QPSDPAVNGP FRRAIQETLG AQRRFRVSLD DVKAAQPVGP
SVLSRSLLFM LMLSELIMQA IMYMNSPGFR SKRRDSARPR LLRSVMLTMP PGMPVAEQRI
FRARAEAAIA FVWSVTGRTG KPPTLRAELD EATATQIVWL HNEVTERLGG HVEALFDLYG
DGRVGEDGRP AVRVASIDIG GGTTDLMITT YTHIGGDALS PHQDFRESFK IAGDDLLERV
ILTVVLPPLE KALTDAGIAE AHQLLVRLLE RDYGNQDERD RHRRRLFVST VLEPTALRAL
SLYEQVDDLT QGGIGRFALG DDTVARRDDD HRKRCAAFLR EQVDTHMRGL NIGDGPPDFD
LFGVVVEVEV ETIEKAIRAT LDTVLGSLTE LVWAYGCDVL LLSGRPSKLR RVRNLVTGAM
PVAPHRIVAM HDYTVGRHYP FTDSAGRISD PKTTVVVGAA LCIKAEGKLQ NFVLRTGKLV
MRSTARFLGR MERTGMIRKA DILLENVDLD DRPSDEDIRF PLGDFEGMTM IGFRQLPLER
WTTTPLYCAE FAAGTREETQ RLAMPLTIEF DRKPDIEDAE EKGDFSGRED FSVSEITDAD
GQPVRRSLII LRLQTTLNPE GYWRDTGCLT LGATLP