Gene Gdia_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0476 
Symbol 
ID6973871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp521652 
End bp522797 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content72% 
IMG OID643390009 
ProducttRNA synthetase class II (G H P and S) 
Protein accessionYP_002274887 
Protein GI209542658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00108226 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGACG ACCCGCCTCC CAATCCCGCC CTGCTGCCCG CGGGCTTTGT CGACCTGCTT 
CCTTCCGATG CCGAGGCCGA GGCCGCCGGC ATGGAAGACC TGATGCGGGT CTTCGCGACG
CACGGATACA GCCGCGTGAA GCCGCCGCTG CTGGAATTCG AGGATACGCT GCTGAGCGGA
TCGGGCGCGG CCGTGGCCGA CCAGGCCTTT CGCCTGATGG ACCCGGACAC GCGGCGGATG
ATGGCGCTGC GTCCCGACAT CACGCCGCAG ATCGCGCGGA TCGCGGCGAC CCGGCTGGCC
GATGTGCCGC GTCCGCTGCG CCTGTCCTAT ACCGGGCTGT GCGTGCTGGC CAGCGGCGGG
GCGCGGGAAA GCGACCGCCA GATTTCCCAG GCCGGGATCG AACTGATCGG CCCGGACTCG
CCCCAGGCCG ACGCTGAAAT CATCGCACTG GGGGCCGAAG GCCTGGCGGT GCTGGGGGTG
CCCGGCGTGT CGTTCGACCT GACGATGCCC TCGCTGGCGC CGGCGCTGAT CGCGGAAATC
GACTACACCC CGGCCGAGCG GCAGGCGCTG ATGCGCGCCC TGGACCGCAA GGACGCGGCG
GCCGTCGCGC GCCTGGCCGG CGGCCTGGCG CCGGTGCTGA CCGAATTGCT GCATGCGGCG
GGTCCTGCCG AACGCGCGCT GGCGGTGCTG GGTGGGCTGG TCCTGCCGGA ACCGGTCCGG
GCGCTGAGCG ACCGGCTGGC CGCCACCTGT GCCGCGATCA AGGCGCGCGT GCCCGATATC
CGCCTGACGG TCGATCCGGT CGAATTCCGG GGCTGGCGCT ACCATACCGG CGTGTGCGTG
ACCGTGTACG CCCGTGGCCA GCATGAAGAA CTGGGCCGGG GCGGGCGCTA TATTTCCAAC
AATGACGAAC CGGCCTGCGG CCTGACCCTG CGCCCCGAGG CCCTGCTGCG GGCCGCCCCG
ACCCGGCCCG GACGCATCCG CGTGTTCCTG CCCGCCGGGT GCGACCCGGC GCTGGGCCGG
CGGCTGCGGA CCGAAGGCTA TGCCACGGTC GACGCCCTGG CACCGGTCGC CGACCAGGCG
GCCGAGGCAA GGCGCCTGGG TTGCACGATG ATCGCGGCCG GCGACGGAAT CGTGGCGTTA
GACTGA
 
Protein sequence
MTDDPPPNPA LLPAGFVDLL PSDAEAEAAG MEDLMRVFAT HGYSRVKPPL LEFEDTLLSG 
SGAAVADQAF RLMDPDTRRM MALRPDITPQ IARIAATRLA DVPRPLRLSY TGLCVLASGG
ARESDRQISQ AGIELIGPDS PQADAEIIAL GAEGLAVLGV PGVSFDLTMP SLAPALIAEI
DYTPAERQAL MRALDRKDAA AVARLAGGLA PVLTELLHAA GPAERALAVL GGLVLPEPVR
ALSDRLAATC AAIKARVPDI RLTVDPVEFR GWRYHTGVCV TVYARGQHEE LGRGGRYISN
NDEPACGLTL RPEALLRAAP TRPGRIRVFL PAGCDPALGR RLRTEGYATV DALAPVADQA
AEARRLGCTM IAAGDGIVAL D