Gene Gdia_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0171 
Symbol 
ID6973563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp185312 
End bp186655 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content62% 
IMG OID643389704 
Producttransposase IS1182 family protein 
Protein accessionYP_002274585 
Protein GI209542356 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0154733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCT TCATTCCGTT TGATCGATCG CAGCCGTATC TTCTGCCGCC GGATCTGAAG 
TCATGGCTGC CGTCGGACGA TGTGGCGCAT TTCATCGTGG CGGCGGTAGA GCGGGTGCCG
TTGAGGGCAT TTTCCGTTCC TGTGCGGACT GGCGGCAAGG CGCAGTATCA TCCGCGCCTG
ATGCTGGCGC TGCTGATTTA CGCCTATGCG AACGGCGTGT TCTCATCGCG TCGGATCGAA
CGGGCGACAT ATCGTGATAT CGGTATGCGC TTTGTGGCGG CGAACCTGCA TCCTGACCAT
GACACGATCG CGACGTTCCG GCGCGGCAAC CGCACGGCGA TCGAGGCAGC GTTCATGCAT
GTACTCCTGC TGGCACGCGA GACGGGACTG GTGCGGCTTG GCACGGTGTC GATCGACGGC
ACGAAGATCG ATGCCAATGC CTCGAAATAC CGTTCCATTC GTTATGATCG CGCGAAAGAG
CTGCGCGAGA AACTGGCCAC CGATATCTCC ACCCTGATGG AACGGGCAGA GGCGGCGGAT
ACAACCGATG TGGATCATCA GGCGTTGCCG GAGGAACTGG CCCGGCGGGA GGCTCTGAAG
GCAAAGCTGG ATGAAGCCTG TGCGCGACTG GAGGCGGAAG CCCGCGAGCA GGCCAAGACC
GCCCGACCAG AATATGAGCG CAAGAAGGCA GCTTTTGATG CGAAGCGGGG ACGGCGCGGT
CGGCCGCCGA AAGAACCGGA CGATGAACCG CCACCAGACC GGCAGATCAA CCTGACCGAT
CCGGACAGCA AGCTGATGCG CCGCTCCGAC GCGCATGAAT ACCGGCAAGC CTACAATGCC
CAGGCCGTGG TTTGTGCCGA GGGCAGCCAG TTGATCTTGG AAAATGGCGT CGTTGCGACG
ACGGCGGACG CGCCCAGCTT CGCCGCCACC ATCCTGGGTA TGGAGGAGAG GATCGGCCTG
CCACGAACCG TCCTCGCCGA CACGGGTTTC GCCAGCGGCA AAGCCGTCGA AACGTTGCAG
GCCAGCGGCG TGGACCCGCT GGTCGCCATC GGACGCCCTG TGAATCGGCG CCCTTATGAC
TTCCGGCCAG AACCGCCACC CAGGGAGCCG CGCCGGATCA CCGAGCCCTG GCGCCTGGAA
ATGAAGGCCA GGCTGCAACA GAACCCGGCA AAAGCCCTTT ACGCCTTACG CAAGCAGACC
GTCGAACCGG TCTTCGGTAT CATCAAGAGC GCCATGGGCT TCACCCGTTT CCATCTCCGT
GGCCTCCCCA ACGTCGCAAC AGAATGGACG CTCGTCGCCC TCGCATATAA TTGCCGTAGG
ATCACGCGAC TGACGGCCGC ATAA
 
Protein sequence
MSSFIPFDRS QPYLLPPDLK SWLPSDDVAH FIVAAVERVP LRAFSVPVRT GGKAQYHPRL 
MLALLIYAYA NGVFSSRRIE RATYRDIGMR FVAANLHPDH DTIATFRRGN RTAIEAAFMH
VLLLARETGL VRLGTVSIDG TKIDANASKY RSIRYDRAKE LREKLATDIS TLMERAEAAD
TTDVDHQALP EELARREALK AKLDEACARL EAEAREQAKT ARPEYERKKA AFDAKRGRRG
RPPKEPDDEP PPDRQINLTD PDSKLMRRSD AHEYRQAYNA QAVVCAEGSQ LILENGVVAT
TADAPSFAAT ILGMEERIGL PRTVLADTGF ASGKAVETLQ ASGVDPLVAI GRPVNRRPYD
FRPEPPPREP RRITEPWRLE MKARLQQNPA KALYALRKQT VEPVFGIIKS AMGFTRFHLR
GLPNVATEWT LVALAYNCRR ITRLTAA