Gene Gdia_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2196 
Symbol 
ID6975624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2434853 
End bp2436799 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content68% 
IMG OID643391725 
Productsulfotransferase 
Protein accessionYP_002276569 
Protein GI209544340 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCATA CAGACCCACG CTCGGCCGTG GCGCCGCCTT TCCCTCCGGC GGTGGAAGAA 
GGCGGGCAGA TCGACGCCAT CGCCCGCCAG TGCGAACACA TCCTGGACCA GGAGCCCGAT
CACCCCGGCG CGTCCTGTCT TCTGGGAACG ATCCACGCCC GGCAGGGAAA ATTCGAAAGC
GCCATACCGC TGTTGCGGCG CGCCCTGGCG CGGATGCCGG CGAATGCCGA AGGATACAAC
GTTCTCGGCA TGGCGTTGCG CGATGCCGGA CAGGCCGAAG ACGCGATCGC CTGCTTTCGC
AGGGCGGTCG CCATCCGGCC GGACCATCAG GGCGCGCGCA CCAACCTGGG CAATGCCCTG
GTGGCCGGCG GCGACCGCGC GGGCGCCATC GCGCAGTTTC GCGCGCTCCT GACGCTCGAC
ACCCAACTGG CCGCCATCGC GGACTATCGC ACGGCCCTGG CGGCCGATCC GGCGGACGTC
GAAACCCTCA TCAGGCTGGG CGCGGCGCTT CGGACCATCG GACGGTGCGA GGAGGCGGCC
GCGCACTTCC AGGCGGCATC GAGCCATGCC CCCGACCGCG TCGCGGCCCG GCTGCACCGC
GCCGGCGCCC TGGCCGAACT CGGCCGCATC GATGACGCGA TGGCCTGCTA CCAGTCGGTC
CTGGACCGGG ACGCGAACAA TTATACCGTG CTGCTCATGA TGGGGGAGCT GCTCCAGAAA
AACGAACGCT ATGCCGAAGC GATCCGGTAT CTGGAGCAGG CCCGCGCATT GCAGCCCGAT
GCGGCGTCGG TCCATGCCGG CCTGGGCGTG TCGTTGCAGG TCATCGGACA GATCGCCGCC
GCCGCCGCGT GTTTTCGCCG CGCGATCGCC CTGGCGCCCG ACCGCCTGGC GGTTTACCTG
GCCCTGACCC GGATCGAGAA ACTGACCGCC GACGATCCCA TCCTGACCGC CCTGCAGGAG
CGTGCCGGAA ACGAGGCCGC GCTGACCGAC GGCGAAAGGA TCGACATTCA TTTCGCCCTC
GGCAAGGCGC TGTCCGACAT CGGCCGGCAT CGGGAATCGT TCGATCATTT CCTGAAGGGA
AACGCCCTGC GGCGACGCGA GATCGTCTAT GACGAGAACA GGATGGTCGC GGCGCTGCGC
CGGACGCGCG AGGAATTCTC GGCCGGGGCG ATCGCGGACC TGGCCCGGAC GGGGCACCCT
TCGGCCCGCC CCATCTTCAT CGTCGGCATG CCGCGATCGG GATCGACGCT GGTCGAACAG
ATCCTGGCCA GCCATCCCGA TGTCCACGGC GCGGGCGAAG TCACCACGCT GGCCGATACG
TTCAAGGACG CCATGGAACG CTTCCCCGCA TGGCGGACGA TCGCGCCGCT GGCCGCCCTG
ACGGAGGCCG AGCGCCTGTC GGTCGCCGAG GACTATCTGC GGCGGCTGGA CGCGCTGGTC
CCGGACGGGG CGGGGGCGAC GGCGCGTGTT ACGAACAAGA CATTGGGCAA TTATTTCTTT
ATCGGACTGA TTCGCCAGCT CTGGCCCCAT GCGTCGATCA TCCACACAGT CCGCGACCCG
ATCGATACCT GCCTGTCGTG CTTTTCGATT CCGTTCGCGG CACAGGATTT TTCCTTCGAC
CTGGGGGAGC TTGGCCGCCG CTATCGGTGC TATCGGGACA TGATGGACCA CTGGCGGCAG
GTCCTGCCGG CCGGGGCGAT GCTGGATGTG CGCTACGAGG ACGTGGTCGC CGACCTTGAA
GGCAGCGCGC GCCGGATCGT CGCCTATTGC GGCCTGCCCT GGGACGATGC CTGCCTGCGG
TTCCACGAGA CCCGGCGGCC GGTGAAGACG TCGAGCATGG AACAGGTCAG GAAACCGATC
TATCGCAGCG CCGTCGGCCG CTGGCGGCCC GACGATGCGA CGTTGCGGCC CCTGCTGGAT
GGGCTTGGCG CCCATTTTGC CCCATAA
 
Protein sequence
MPHTDPRSAV APPFPPAVEE GGQIDAIARQ CEHILDQEPD HPGASCLLGT IHARQGKFES 
AIPLLRRALA RMPANAEGYN VLGMALRDAG QAEDAIACFR RAVAIRPDHQ GARTNLGNAL
VAGGDRAGAI AQFRALLTLD TQLAAIADYR TALAADPADV ETLIRLGAAL RTIGRCEEAA
AHFQAASSHA PDRVAARLHR AGALAELGRI DDAMACYQSV LDRDANNYTV LLMMGELLQK
NERYAEAIRY LEQARALQPD AASVHAGLGV SLQVIGQIAA AAACFRRAIA LAPDRLAVYL
ALTRIEKLTA DDPILTALQE RAGNEAALTD GERIDIHFAL GKALSDIGRH RESFDHFLKG
NALRRREIVY DENRMVAALR RTREEFSAGA IADLARTGHP SARPIFIVGM PRSGSTLVEQ
ILASHPDVHG AGEVTTLADT FKDAMERFPA WRTIAPLAAL TEAERLSVAE DYLRRLDALV
PDGAGATARV TNKTLGNYFF IGLIRQLWPH ASIIHTVRDP IDTCLSCFSI PFAAQDFSFD
LGELGRRYRC YRDMMDHWRQ VLPAGAMLDV RYEDVVADLE GSARRIVAYC GLPWDDACLR
FHETRRPVKT SSMEQVRKPI YRSAVGRWRP DDATLRPLLD GLGAHFAP