Gene Gdia_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1888 
Symbol 
ID6975311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2104412 
End bp2106070 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content70% 
IMG OID643391414 
Productglycosyl transferase family 39 
Protein accessionYP_002276263 
Protein GI209544034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0174434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGC TGACCTTGCG GCATTACGTC ATGCTGGCGT TGTGTACGTT CATGATCTTC 
CTGCCCGGGC GGGCCAGCCT GCCGCCCCTG GACCGGGACG AGCCGCGCTA TATGGAAGCC
AGCGCGCAGA TGCTGCGCAG CGGCAATTTC ATCGATGTCC GGTTCCAGGA CCAGCCGCGC
TATCTGCAGC CCGCCGGCAT CTACTGGCTG GAGGCCGCCT CGGTCGCCGC CACCGGCACC
CTGCGCCAGC ATGCGGTGTG GGCCTATCGC ATTCCGTCGC TGCTGGCGGT GACGGCCGTG
GTGGTGCTGA CGGCCTGGAT CGGCGCCACC CTGTTCGGTC CGGCCAGCGG TCTGCTGGCG
GCGGGACTGC TCGCCGTGTC GGTGCTGACG ACGGCCGAGG GCCGGATGGC CACCATCGAC
ACCACCCTGC TGCTGGCGGT GCTGCTGGCG CAGACGGGGC TCCTGCGCGC CTATCTGGAC
CGTGAACGCG ACCGGCCGAC GCCGCTTTCG GCCGCGCTGC TGTACTGGAC GGCGCTGGGG
GTGGGGCTGA TGCTCAAGGG GCCGGTGGTG CTGATCCCGG GCTTCGGCAC GCCGCTGGCG
CTGGCGCTGG TGGAACGGCG CATCGACTGG TGGCACCGGC TGCGGCCCGC GTGGGGCTGG
GCGGTGATGC TGGCGATCGT CCTGCCCTGG TGCGTCGCGA TCGGGGTCGT CAGCCATGGC
GATTTCTTCT CGCGCGCGGT GGGAACCAAT TTCCTCGGCA AGGTGGCCCA TGGCCAGCAG
GCGCACGGCC TGCCGCCCGG GTATCACCTG CTGGCGTTCG CCATCGCCTT CTGGCCCGGC
TCGATCTTCG CGGCGATGGC GCTGCCCTTC GTCTGGGCCC GGCGGCATGC GCCGCCGGTG
CGTTTCCTGC TGTGCTGGAT CGTGCCGCAC TGGCTGGTGT TCGAAGCCAT CGCGACCAAG
CTGCCGCATT ACGTGCTGCC CACCTATCCG GCGATCGCGA TGCTGACCGC GGCGGCGATC
ATGACCATGC CCGACCGCTG GTCATGGCCG GCCGCGCTGT GGGGCCGGGT GGTGCTGGCG
GTGTACGGCG TGCTGTGGCT GGTGCTGGGG GTCGCGCTGT CCGTGGCGGG GCCTGTGCTG
CTGTGGCGGC TGGAGCATCG GGTGGAGCCC GCGGCGCTGA TCGTGCCGCT GGGCGCGTTG
CCGCTGGTGC TGGTGTCCGC CTGGCTGCTG GTGGGGCGCC AGCCCCTGCG GGCCGCCATG
GCGGCGGTCG CGGCGGCGGT AATCATCCAT GTCGGCCTGT TCGTGACCGT GATCCCGAAC
CTGCAGGCGA TCTGGCTCAG TCCGCGCCTG GCCGCGCTGG TGGACGATTA CCGGCCGTGC
CCGGATACGA TCGTGGCCTC GCCCTCGTTC TCGGAACCCA GCCTGGTGTT CCTGGTGGGG
CAGAATACGG CGCTGGTCGA TCCCGTTGCC GCGGCCGACC TGCTGCGCGA CAACCGGGCC
TGCGGCCTGG CGCTGGTGGA CCGCCGCGAC GAACCGGCCT TTCGCGCGCG CCTGCGGCGG
GACGGCCTGA ACGTGATCGA ATTCGGCCGC GTCGCGGGGC TGAATTATTC GACGGGCAAG
CATCTCGATA TCGGGCTGTT TGGACCGACA CCCCCATAA
 
Protein sequence
MTRLTLRHYV MLALCTFMIF LPGRASLPPL DRDEPRYMEA SAQMLRSGNF IDVRFQDQPR 
YLQPAGIYWL EAASVAATGT LRQHAVWAYR IPSLLAVTAV VVLTAWIGAT LFGPASGLLA
AGLLAVSVLT TAEGRMATID TTLLLAVLLA QTGLLRAYLD RERDRPTPLS AALLYWTALG
VGLMLKGPVV LIPGFGTPLA LALVERRIDW WHRLRPAWGW AVMLAIVLPW CVAIGVVSHG
DFFSRAVGTN FLGKVAHGQQ AHGLPPGYHL LAFAIAFWPG SIFAAMALPF VWARRHAPPV
RFLLCWIVPH WLVFEAIATK LPHYVLPTYP AIAMLTAAAI MTMPDRWSWP AALWGRVVLA
VYGVLWLVLG VALSVAGPVL LWRLEHRVEP AALIVPLGAL PLVLVSAWLL VGRQPLRAAM
AAVAAAVIIH VGLFVTVIPN LQAIWLSPRL AALVDDYRPC PDTIVASPSF SEPSLVFLVG
QNTALVDPVA AADLLRDNRA CGLALVDRRD EPAFRARLRR DGLNVIEFGR VAGLNYSTGK
HLDIGLFGPT PP