Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2196 |
Symbol | |
ID | 6975624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2434853 |
End bp | 2436799 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643391725 |
Product | sulfotransferase |
Protein accession | YP_002276569 |
Protein GI | 209544340 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCATA CAGACCCACG CTCGGCCGTG GCGCCGCCTT TCCCTCCGGC GGTGGAAGAA GGCGGGCAGA TCGACGCCAT CGCCCGCCAG TGCGAACACA TCCTGGACCA GGAGCCCGAT CACCCCGGCG CGTCCTGTCT TCTGGGAACG ATCCACGCCC GGCAGGGAAA ATTCGAAAGC GCCATACCGC TGTTGCGGCG CGCCCTGGCG CGGATGCCGG CGAATGCCGA AGGATACAAC GTTCTCGGCA TGGCGTTGCG CGATGCCGGA CAGGCCGAAG ACGCGATCGC CTGCTTTCGC AGGGCGGTCG CCATCCGGCC GGACCATCAG GGCGCGCGCA CCAACCTGGG CAATGCCCTG GTGGCCGGCG GCGACCGCGC GGGCGCCATC GCGCAGTTTC GCGCGCTCCT GACGCTCGAC ACCCAACTGG CCGCCATCGC GGACTATCGC ACGGCCCTGG CGGCCGATCC GGCGGACGTC GAAACCCTCA TCAGGCTGGG CGCGGCGCTT CGGACCATCG GACGGTGCGA GGAGGCGGCC GCGCACTTCC AGGCGGCATC GAGCCATGCC CCCGACCGCG TCGCGGCCCG GCTGCACCGC GCCGGCGCCC TGGCCGAACT CGGCCGCATC GATGACGCGA TGGCCTGCTA CCAGTCGGTC CTGGACCGGG ACGCGAACAA TTATACCGTG CTGCTCATGA TGGGGGAGCT GCTCCAGAAA AACGAACGCT ATGCCGAAGC GATCCGGTAT CTGGAGCAGG CCCGCGCATT GCAGCCCGAT GCGGCGTCGG TCCATGCCGG CCTGGGCGTG TCGTTGCAGG TCATCGGACA GATCGCCGCC GCCGCCGCGT GTTTTCGCCG CGCGATCGCC CTGGCGCCCG ACCGCCTGGC GGTTTACCTG GCCCTGACCC GGATCGAGAA ACTGACCGCC GACGATCCCA TCCTGACCGC CCTGCAGGAG CGTGCCGGAA ACGAGGCCGC GCTGACCGAC GGCGAAAGGA TCGACATTCA TTTCGCCCTC GGCAAGGCGC TGTCCGACAT CGGCCGGCAT CGGGAATCGT TCGATCATTT CCTGAAGGGA AACGCCCTGC GGCGACGCGA GATCGTCTAT GACGAGAACA GGATGGTCGC GGCGCTGCGC CGGACGCGCG AGGAATTCTC GGCCGGGGCG ATCGCGGACC TGGCCCGGAC GGGGCACCCT TCGGCCCGCC CCATCTTCAT CGTCGGCATG CCGCGATCGG GATCGACGCT GGTCGAACAG ATCCTGGCCA GCCATCCCGA TGTCCACGGC GCGGGCGAAG TCACCACGCT GGCCGATACG TTCAAGGACG CCATGGAACG CTTCCCCGCA TGGCGGACGA TCGCGCCGCT GGCCGCCCTG ACGGAGGCCG AGCGCCTGTC GGTCGCCGAG GACTATCTGC GGCGGCTGGA CGCGCTGGTC CCGGACGGGG CGGGGGCGAC GGCGCGTGTT ACGAACAAGA CATTGGGCAA TTATTTCTTT ATCGGACTGA TTCGCCAGCT CTGGCCCCAT GCGTCGATCA TCCACACAGT CCGCGACCCG ATCGATACCT GCCTGTCGTG CTTTTCGATT CCGTTCGCGG CACAGGATTT TTCCTTCGAC CTGGGGGAGC TTGGCCGCCG CTATCGGTGC TATCGGGACA TGATGGACCA CTGGCGGCAG GTCCTGCCGG CCGGGGCGAT GCTGGATGTG CGCTACGAGG ACGTGGTCGC CGACCTTGAA GGCAGCGCGC GCCGGATCGT CGCCTATTGC GGCCTGCCCT GGGACGATGC CTGCCTGCGG TTCCACGAGA CCCGGCGGCC GGTGAAGACG TCGAGCATGG AACAGGTCAG GAAACCGATC TATCGCAGCG CCGTCGGCCG CTGGCGGCCC GACGATGCGA CGTTGCGGCC CCTGCTGGAT GGGCTTGGCG CCCATTTTGC CCCATAA
|
Protein sequence | MPHTDPRSAV APPFPPAVEE GGQIDAIARQ CEHILDQEPD HPGASCLLGT IHARQGKFES AIPLLRRALA RMPANAEGYN VLGMALRDAG QAEDAIACFR RAVAIRPDHQ GARTNLGNAL VAGGDRAGAI AQFRALLTLD TQLAAIADYR TALAADPADV ETLIRLGAAL RTIGRCEEAA AHFQAASSHA PDRVAARLHR AGALAELGRI DDAMACYQSV LDRDANNYTV LLMMGELLQK NERYAEAIRY LEQARALQPD AASVHAGLGV SLQVIGQIAA AAACFRRAIA LAPDRLAVYL ALTRIEKLTA DDPILTALQE RAGNEAALTD GERIDIHFAL GKALSDIGRH RESFDHFLKG NALRRREIVY DENRMVAALR RTREEFSAGA IADLARTGHP SARPIFIVGM PRSGSTLVEQ ILASHPDVHG AGEVTTLADT FKDAMERFPA WRTIAPLAAL TEAERLSVAE DYLRRLDALV PDGAGATARV TNKTLGNYFF IGLIRQLWPH ASIIHTVRDP IDTCLSCFSI PFAAQDFSFD LGELGRRYRC YRDMMDHWRQ VLPAGAMLDV RYEDVVADLE GSARRIVAYC GLPWDDACLR FHETRRPVKT SSMEQVRKPI YRSAVGRWRP DDATLRPLLD GLGAHFAP
|
| |