Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1888 |
Symbol | |
ID | 6975311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2104412 |
End bp | 2106070 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643391414 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002276263 |
Protein GI | 209544034 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0174434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGGC TGACCTTGCG GCATTACGTC ATGCTGGCGT TGTGTACGTT CATGATCTTC CTGCCCGGGC GGGCCAGCCT GCCGCCCCTG GACCGGGACG AGCCGCGCTA TATGGAAGCC AGCGCGCAGA TGCTGCGCAG CGGCAATTTC ATCGATGTCC GGTTCCAGGA CCAGCCGCGC TATCTGCAGC CCGCCGGCAT CTACTGGCTG GAGGCCGCCT CGGTCGCCGC CACCGGCACC CTGCGCCAGC ATGCGGTGTG GGCCTATCGC ATTCCGTCGC TGCTGGCGGT GACGGCCGTG GTGGTGCTGA CGGCCTGGAT CGGCGCCACC CTGTTCGGTC CGGCCAGCGG TCTGCTGGCG GCGGGACTGC TCGCCGTGTC GGTGCTGACG ACGGCCGAGG GCCGGATGGC CACCATCGAC ACCACCCTGC TGCTGGCGGT GCTGCTGGCG CAGACGGGGC TCCTGCGCGC CTATCTGGAC CGTGAACGCG ACCGGCCGAC GCCGCTTTCG GCCGCGCTGC TGTACTGGAC GGCGCTGGGG GTGGGGCTGA TGCTCAAGGG GCCGGTGGTG CTGATCCCGG GCTTCGGCAC GCCGCTGGCG CTGGCGCTGG TGGAACGGCG CATCGACTGG TGGCACCGGC TGCGGCCCGC GTGGGGCTGG GCGGTGATGC TGGCGATCGT CCTGCCCTGG TGCGTCGCGA TCGGGGTCGT CAGCCATGGC GATTTCTTCT CGCGCGCGGT GGGAACCAAT TTCCTCGGCA AGGTGGCCCA TGGCCAGCAG GCGCACGGCC TGCCGCCCGG GTATCACCTG CTGGCGTTCG CCATCGCCTT CTGGCCCGGC TCGATCTTCG CGGCGATGGC GCTGCCCTTC GTCTGGGCCC GGCGGCATGC GCCGCCGGTG CGTTTCCTGC TGTGCTGGAT CGTGCCGCAC TGGCTGGTGT TCGAAGCCAT CGCGACCAAG CTGCCGCATT ACGTGCTGCC CACCTATCCG GCGATCGCGA TGCTGACCGC GGCGGCGATC ATGACCATGC CCGACCGCTG GTCATGGCCG GCCGCGCTGT GGGGCCGGGT GGTGCTGGCG GTGTACGGCG TGCTGTGGCT GGTGCTGGGG GTCGCGCTGT CCGTGGCGGG GCCTGTGCTG CTGTGGCGGC TGGAGCATCG GGTGGAGCCC GCGGCGCTGA TCGTGCCGCT GGGCGCGTTG CCGCTGGTGC TGGTGTCCGC CTGGCTGCTG GTGGGGCGCC AGCCCCTGCG GGCCGCCATG GCGGCGGTCG CGGCGGCGGT AATCATCCAT GTCGGCCTGT TCGTGACCGT GATCCCGAAC CTGCAGGCGA TCTGGCTCAG TCCGCGCCTG GCCGCGCTGG TGGACGATTA CCGGCCGTGC CCGGATACGA TCGTGGCCTC GCCCTCGTTC TCGGAACCCA GCCTGGTGTT CCTGGTGGGG CAGAATACGG CGCTGGTCGA TCCCGTTGCC GCGGCCGACC TGCTGCGCGA CAACCGGGCC TGCGGCCTGG CGCTGGTGGA CCGCCGCGAC GAACCGGCCT TTCGCGCGCG CCTGCGGCGG GACGGCCTGA ACGTGATCGA ATTCGGCCGC GTCGCGGGGC TGAATTATTC GACGGGCAAG CATCTCGATA TCGGGCTGTT TGGACCGACA CCCCCATAA
|
Protein sequence | MTRLTLRHYV MLALCTFMIF LPGRASLPPL DRDEPRYMEA SAQMLRSGNF IDVRFQDQPR YLQPAGIYWL EAASVAATGT LRQHAVWAYR IPSLLAVTAV VVLTAWIGAT LFGPASGLLA AGLLAVSVLT TAEGRMATID TTLLLAVLLA QTGLLRAYLD RERDRPTPLS AALLYWTALG VGLMLKGPVV LIPGFGTPLA LALVERRIDW WHRLRPAWGW AVMLAIVLPW CVAIGVVSHG DFFSRAVGTN FLGKVAHGQQ AHGLPPGYHL LAFAIAFWPG SIFAAMALPF VWARRHAPPV RFLLCWIVPH WLVFEAIATK LPHYVLPTYP AIAMLTAAAI MTMPDRWSWP AALWGRVVLA VYGVLWLVLG VALSVAGPVL LWRLEHRVEP AALIVPLGAL PLVLVSAWLL VGRQPLRAAM AAVAAAVIIH VGLFVTVIPN LQAIWLSPRL AALVDDYRPC PDTIVASPSF SEPSLVFLVG QNTALVDPVA AADLLRDNRA CGLALVDRRD EPAFRARLRR DGLNVIEFGR VAGLNYSTGK HLDIGLFGPT PP
|
| |