Gene Gdia_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0071 
Symbol 
ID6973460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp80561 
End bp82462 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content70% 
IMG OID643389604 
Productsurface antigen (D15) 
Protein accessionYP_002274488 
Protein GI209542259 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.309783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.034078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCTGG TCGTCTTCGT GGGCTGGATG CCGCCGCCCG TTCGCGGCGC GGACCCGCAA 
TCCTACGTCA CCGTGATCCG CCCGACCGGC CAGGGCGACC TGGACGCGGC GATCAGCGCG
TCCTCCAGCC TGCTGTCCCT GCAGAAGACC AAGGCCGTCA GCCCGTTCGC GCTCGCCGGC
CGCATCCGCA ACGATTACGA CCGCCTGCGC ACCGCCCTGG AAAGCTACGG CTACTACGCC
GCGACGATCC GCATCGCCGT CGGACTGCGC GCCGGGGGGC ACGCCGCGCC GTCGGCGCCG
CCGGCGACGA TGGATGGCCA GGACCCGCGC CTGCCGGAAT GGCTGCTGGC GGTTCCCAAG
GGGCAGGCCG TGCAGGTCAC CATCACGCCG GTCCGGGGCG ACATCTTCCA TCTGGGGCAG
GTGACGCTGA AGCCGTCGCC GGAGGACGGC ACGGCCCCGA TCGTCCTCAA CGCCCCGGAA
CGCACCGCCC TGGGCGTGGC CTCGGGCCAT CCGGCCATCG CGTCCGACGT GCTGGCGGGC
GGGGTGAACC TGCAGGCGGA ACTGAAGGAG GAAGGCCACG CCCTGGCGCA GGTCGGCACG
CCCAAGGCCT GGCTGCGGCC CCAGACCCAT ACGCTGGACG TCGAATACAC CGTGCGGCGC
GGGCCGATCG TGACGATCGG CGCCATCGCG CTGTCCGGGC TGAAGCGGAC CCATCCCGCC
TATATCGCGC GGCGGATCAC CCTGCACCCC GACCAGCTTT ACCAGCCGTC ACACATCGAG
GCGGCGCGGC AGGACCTGGC GTCGCTCGGC GTGTTTTCCG ACGTGCAGGC CAGCGACGCG
CCGCCGCTGA CGGCTGGCCG GCAAATGCCG CTGAACTTCG CCTTCACCGA GGGCAAGCAG
CGGATGGCGG AGGTGGAGGG CGGATATTCC ACCGACCTGG GCGGCCGGGG CGGCGTAAGC
TGGACGCACA ACAACATCTT CGGCAATGCC GAGCGCCTGC GCCTGACCAC CCTGGTGACG
GGACTGGGCG GTTCGGCGCA GCAGGGGCTG GGCTATGACG TATATGCCGA CCTGCTGAAG
CCGGATTTCG GTGACCGCGA CCAGAACCTG AGCGTGCGGG TCGAGGGAAT CCGCCAGTTG
CTCTATTCCT ACCGGCAGAC GGCGCTGCTG GTCCGCGCGG GCATCGTCCG CCATCTGGGG
CGGCGATGGA CGGTGTCTTT CGGCGGCGAG GCCGAACAGG AACATATCGA ACAGATGGGG
ATGTCCAACG ACTACACCAT CGTGTCCCTG CCCCTGTCCG CGACCTATGA CAGCACGGGG
CTGACCAACC CGATCGACCC CGCGACCCAC GGGGTGCGCA TCGCCGCCAG CGCGACGCCC
TCGGCCTCCC TGATCAGCGG CACGTCGTTC TTCACCATCC TGCAGGCGAC GGCATCCACC
TATTTCGACC TGTCGCATGT GGGCCTTTCG CGGCCCGGGC GCAGCGTCTT CGCGTTTCGC
GGCGTCGTCG GCAGCGTGCA GGGGGCTTCG ACGTTCGAGA TTCCGCCTGA TCAACGCCTG
TATGCCGGCG GCAGCGCGAC CGTGCGCGGC TTCCGCTACC AGGGCGTGGG GCCGCAATTT
CCCAACAGCA AATACGCGAT CGGCGGCACG TCGATGGATG CGGGCACCGT GGAATTCCGC
CAGCGCCTGT TCCGCAGCTT CGGCGCGGCG CTGTTCGCCG ATGCCGGCCA GGTCGACACC
GGCAGCAGCC CCCTGCATGG CACGCTGCGC GTCGGCGCAG GGGCAGGGGT GCGGTACTAT
ACGCCGATCG GCCCGGTGCG GGTGGACGTC GCGTTCCCGC TGAACCGGCC GGCGCAAGGC
GATACGTGGG AACTCTATAT CGGCCTGGGG GAAACCTTCT GA
 
Protein sequence
MGLVVFVGWM PPPVRGADPQ SYVTVIRPTG QGDLDAAISA SSSLLSLQKT KAVSPFALAG 
RIRNDYDRLR TALESYGYYA ATIRIAVGLR AGGHAAPSAP PATMDGQDPR LPEWLLAVPK
GQAVQVTITP VRGDIFHLGQ VTLKPSPEDG TAPIVLNAPE RTALGVASGH PAIASDVLAG
GVNLQAELKE EGHALAQVGT PKAWLRPQTH TLDVEYTVRR GPIVTIGAIA LSGLKRTHPA
YIARRITLHP DQLYQPSHIE AARQDLASLG VFSDVQASDA PPLTAGRQMP LNFAFTEGKQ
RMAEVEGGYS TDLGGRGGVS WTHNNIFGNA ERLRLTTLVT GLGGSAQQGL GYDVYADLLK
PDFGDRDQNL SVRVEGIRQL LYSYRQTALL VRAGIVRHLG RRWTVSFGGE AEQEHIEQMG
MSNDYTIVSL PLSATYDSTG LTNPIDPATH GVRIAASATP SASLISGTSF FTILQATAST
YFDLSHVGLS RPGRSVFAFR GVVGSVQGAS TFEIPPDQRL YAGGSATVRG FRYQGVGPQF
PNSKYAIGGT SMDAGTVEFR QRLFRSFGAA LFADAGQVDT GSSPLHGTLR VGAGAGVRYY
TPIGPVRVDV AFPLNRPAQG DTWELYIGLG ETF