Gene Gdia_3375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3375 
Symbol 
ID6976821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3697728 
End bp3699026 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID643392891 
ProductO-antigen polymerase 
Protein accessionYP_002277716 
Protein GI209545487 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.221665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.11504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGG AAAAAGCGAC CTGGCGCGGT GCGGCATTCG ATTCGGTCGC GCGATGGATC 
GCCCTGGCGA TGATGATGGT CATGCCGTTT TTCCAGTTGC GGGGACGGGC GATCAGCGAC
GGCATCATGT CGGCCATCGG CCTGCTGTTC CTCGTGCATT GCATCCGCAC GCGCGCGAAT
GACTGGTGGC GCCGGGGATG GTTCGCCTTC GCCCTGCTGT TCTGCGGGCT CGCGATCCTG
TCGTCGGCCC TGCATGGATC GTCCCATGCG GTGGCCGAGG CCTGCGTCCT GATCCGGTTC
TTCCTGTTCA GCGCGGCGCT GGAACGCTGG GTGCTGCGCG ACGCGGCCTC ACGGCGCTGG
CTGGGGGCCG TCGTGACTGT GGCCGCGCTT TGGCTGGTGG TGGAATGCTG GCAGCAATAT
CTGCTGGGCT ACAACATCTG GGGCTTCCCG CGCTGGCCGG ACGGGGCGCT GACGGGGCCG
TTCTATAAAC CCCGCGCGGG CGCCGCGCTG CTGATGGTCG TGTTCCCCGG CCTGATGCCC
TTCGCCCTGC GCCGGCTGCA GGCAGCGTCC TGGCGGCCGA AACTGGCGGG CATCGCCCTT
ATCATGTTCC TGGTGGTGAC GATGCTGCTG ATCGGGCAGC GCATGCCCAC CCTGCTGTTC
GGGCTGGGGC TGGTGCTGAC GGCACTGTTC GTTCCCTCGA CCCGCTGGGC GGTCTTCGCG
GCCGGAATGG CCGGCGTGGT GGGGCTGTTC CTGCTGCCGA TCCTCTCGCC GCCGGCCTAT
GCCAAGCTGG TGGTGCATTT CCTGGCGCAG ATCCGCGATT TTCCCGACAG CGATTACGGC
CATATCTATA TTCGCGCGGC CGCCATGGTG CGCCAGCATC CATGGCTGGG GCTGGGCGCG
GACGGGTTTC GCGATTTCTG TCCGAATCCG TCCTTTGCCC GCGACCTGTC GCTGTTCGGA
TACGATTTCC ACGTCCCCGT CGGCGCCGGC TGCAACATTC ATCCCCATAA TATCTATCTG
GAGGTCGCGA CCACGGCGGG GCTGCCCGGC CTGGCCTGTT TCGTCGCGAT GGCGGCGGCC
TGGCTGTGGC GGATGCTGCG GGCCCTTTCG CCCGTCGAGG CCCCGCAGCA GGCCATGCTG
TGCGTGATCT GCTGCGTGAT CTTGTGGCCG GTGGCCTCGA ACAGCGCGCT GTTCACGGTG
CGGACGGCGG GGTGGTTCTT CCTGATGGTC GGATGGGGGC TGGCGGCGTC GCGCGACGTG
GCGGGCGAAC GGCGGCTGAA TGCAGGGCGG CGGGCCTAG
 
Protein sequence
MTMEKATWRG AAFDSVARWI ALAMMMVMPF FQLRGRAISD GIMSAIGLLF LVHCIRTRAN 
DWWRRGWFAF ALLFCGLAIL SSALHGSSHA VAEACVLIRF FLFSAALERW VLRDAASRRW
LGAVVTVAAL WLVVECWQQY LLGYNIWGFP RWPDGALTGP FYKPRAGAAL LMVVFPGLMP
FALRRLQAAS WRPKLAGIAL IMFLVVTMLL IGQRMPTLLF GLGLVLTALF VPSTRWAVFA
AGMAGVVGLF LLPILSPPAY AKLVVHFLAQ IRDFPDSDYG HIYIRAAAMV RQHPWLGLGA
DGFRDFCPNP SFARDLSLFG YDFHVPVGAG CNIHPHNIYL EVATTAGLPG LACFVAMAAA
WLWRMLRALS PVEAPQQAML CVICCVILWP VASNSALFTV RTAGWFFLMV GWGLAASRDV
AGERRLNAGR RA