Gene Gdia_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1054 
Symbol 
ID6974451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1180468 
End bp1181844 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID643390576 
Producttransposase IS1182 family protein 
Protein accessionYP_002275452 
Protein GI209543223 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.61591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGTG ATCGAACGAA AATACAGGAG GCACTGTTCT ACGAGTTCCG TCTTGAAGAT 
CATGTGCCAG CTGGCCACCT TTTGCGTTCA ATTGATCGCT TTGTCGATCT GGACGGCCTG
CGTGAGCATC TTCGCCCATT TTACAGCGGC ACGGGGCGGC CTTCAATCGA TCCCGAGCTG
ATCATCAGGA TGTTAATCGT TGGCTACGTG ATGGGTATTC GATCCGAGCG ACGGCTATGT
GAAGAAGTCC ATCTTAACCT GGCCTACCGG TGGTTCTGTG GTCTTGGTCT CAACGGCCCT
GTGCCTGATC ATTCGACATT CTCGAAAAAC CGGCACGGAC GCTTCCGGGA GAGTGATCTG
CTTCGCCAAA TGTTCGAAAT GACCGTCCGG CAGTGCATCG CCAAGGGACT GGTGGGCGGT
GAAGGCTTTG CGGTCGATGC CAGTACGATC AAGGCCGACG CTAATCGGCA GCGCAGTGTT
CCGAGCCCAG ATAAACTGCC GATCGAGGTG GCCCAACGTG CTGTGCGGGA GTATTTCTCG
GTGCTGGACG ATGCTGCATT TGGGTCTGCT ACGCCCGTGC AACCAAAATA TATCTCTCCG
GTCGATCCTG CTGCACGTTG GAACGCTGCA AGCGGTGGCC TTGCTTATTA TGCCTACTGC
ACAAATTACC TCATTGACCT TAAATCGGCT GTCATCATGG ATGTAGAAAC CACGACAGCC
ATCCGGCAGG CCGAGGTTAC GGCGCAACGC AGAATGATAG AGCGTACGCA GGAAACATTT
GGAATATGGC CCGAAAGGCT TGCTGCGGAT ACAGCTTATG GATCCGCAGA AAATCTTGCG
TGGCTGGTTC ATGAGCGTGG CATAGAACCT CACATTCCGG TCTTCGACAA ATCTGCCCGG
CAGGACGGGA CTTTCGAACG TCGAGATTTC ACATATGACC ACGTGCACGA TCTTTACATC
TGTCCTGGAG GACAGCAACT GAAGCAGCAG TGGCGCAAGA TCAACTCGGA TCAACCAAAT
GCCCCTCCCG ACAACCTACT TCGATACCGT TCGTCGAAAC TGGCGTGCGA CGTATGCACT
CTCAAACCAA AATGCTGCCC TAATCAGCCC AATCGTAAGG TTCTGCGCTC TATTCATGAA
GGCGCTCGTG ATATGGCCCG CGACATTGCT TTAACCGACG CCTATATTAT CTCCAGACGA
GAACGAAAGA AGGTCGAAAT GCTATTTGCT CACCTCAAGC GCATTTTGAA GATCGATCGG
TTGAGGCTCA GAGGACCAAA CGGCGCCCGT GATGAGTTCC ATCTCGCCGC AGCTGCCCAA
AATCTCCGCA AAATGGCGAA ACTGATACCT CCCGGAGTGC CTGCCTTATC CACCTGA
 
Protein sequence
MMGDRTKIQE ALFYEFRLED HVPAGHLLRS IDRFVDLDGL REHLRPFYSG TGRPSIDPEL 
IIRMLIVGYV MGIRSERRLC EEVHLNLAYR WFCGLGLNGP VPDHSTFSKN RHGRFRESDL
LRQMFEMTVR QCIAKGLVGG EGFAVDASTI KADANRQRSV PSPDKLPIEV AQRAVREYFS
VLDDAAFGSA TPVQPKYISP VDPAARWNAA SGGLAYYAYC TNYLIDLKSA VIMDVETTTA
IRQAEVTAQR RMIERTQETF GIWPERLAAD TAYGSAENLA WLVHERGIEP HIPVFDKSAR
QDGTFERRDF TYDHVHDLYI CPGGQQLKQQ WRKINSDQPN APPDNLLRYR SSKLACDVCT
LKPKCCPNQP NRKVLRSIHE GARDMARDIA LTDAYIISRR ERKKVEMLFA HLKRILKIDR
LRLRGPNGAR DEFHLAAAAQ NLRKMAKLIP PGVPALST