Gene Gdia_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0041 
Symbol 
ID6973430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp46675 
End bp47964 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content73% 
IMG OID643389574 
ProductFmu (Sun) domain protein 
Protein accessionYP_002274458 
Protein GI209542229 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0290065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0804163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCA GTGCGCGGCT CGCCGCCGCG ATCGATCTGC TGTCGGCGAT GGAGGCCACG 
CCCCGTCGCC CGGCGGATGC CGTCGCCAAT GCCTTCTTCC GTGAACGACG CTATATCGGT
GGCGGAGACA GGCGGGCGAT ATCGGCGCGT GTCTGGACCG TGCTGCGTCA CTGGCGCCAT
CTGGCCTGGT GGCTGGATCG CGCGGGGGCG GCGGCCACGC CCCGGGCGCG GCTGATCGCG
GCGCTGGCGC TGATGCCCCA GCCCGGCGAG GCGCCGCAGG ACCTGTTCGT GTCCCAGGAC
CGCTACGCGC CGCAGCCCCT GTCGGCGGCG GAACGGGACC TGGCCACCCG GCTGCGCGGC
CAGGCCATGG TCCACCCCGA CATGCCGCGT GCGGTCCGGC TGGAAGTGCC GGACTGGCTG
CTGCCGCGGC TGGAGGAAAC GTTCGGCGCC GACCTGGATG CCGAAGTGGC GGCGCTGGCC
GGCGAGGCGA CGCTGGACCT GCGGGTGAAC CTGCTGAAGA CCACGCGGGC CGAGGCCGCG
CGCCTGCTGG CCGCCGACGG CATCATGGCC GAACCGACGG GCCTGTCCCC CTGGGGCCTG
CGCGTGCCCG GGCGGCAGCC GGTGACGGCC ACGGCGGCCT TCAAGTCCGG ACTGGTGGAA
ATCCAGGACG AAGGCAGCCA GATCGTGGTG GCGGCCGCCG ACGCGCGGCC CGGCATGCGG
GTGCTGGATT ACTGCGCGGG CGCCGCCGGC AAGACGCTGG GCATGGCGAT GACCATGGAA
AATCGCGGCC ATATCGTGGC CTGCGACGTC TCCGAACCCC GGCTGGAGGG CGCGGTGCGC
CGCCTGCGCC GCGCGGGCGT CCATAATGCG GAACGGCACC TGCTGGTCCC GGGCGACCGC
TGGGCCCGGC GGCGGGCCGC CAGCTTCGAC CGGGTGCTGG TCGATGCCCC CTGCACCGGA
ACCGGGACAT GGCGGCGCAA TCCCGACGCG CGGCTGCGCC TGACCGAGCA GGACCTGGCC
GAACTGATGG CCAAGCAGGC CGACATCCTG GCGACCGCCT CCGCCCTGGT GCGGCCGGGC
GGGCGTCTGG TCTATGCCAC CTGTTCGATC CTGCGCGAGG AGAACCAGGA CCGGATCGCG
TCGTTCCTGC GCGCCTCGCC GCATTTCCGC CGGGCGGAAA CGGTGCCTGA CCTGGCGCCC
GATCTGGCAC AGGATGGGAT GATCGCGCTG TCACCCTTGC GGCACGGGAC CGACGGATTC
TTCGCCGCGA TCCTGGAACG CACGGCCTGA
 
Protein sequence
MTPSARLAAA IDLLSAMEAT PRRPADAVAN AFFRERRYIG GGDRRAISAR VWTVLRHWRH 
LAWWLDRAGA AATPRARLIA ALALMPQPGE APQDLFVSQD RYAPQPLSAA ERDLATRLRG
QAMVHPDMPR AVRLEVPDWL LPRLEETFGA DLDAEVAALA GEATLDLRVN LLKTTRAEAA
RLLAADGIMA EPTGLSPWGL RVPGRQPVTA TAAFKSGLVE IQDEGSQIVV AAADARPGMR
VLDYCAGAAG KTLGMAMTME NRGHIVACDV SEPRLEGAVR RLRRAGVHNA ERHLLVPGDR
WARRRAASFD RVLVDAPCTG TGTWRRNPDA RLRLTEQDLA ELMAKQADIL ATASALVRPG
GRLVYATCSI LREENQDRIA SFLRASPHFR RAETVPDLAP DLAQDGMIAL SPLRHGTDGF
FAAILERTA