Gene Gdia_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3052 
Symbol 
ID6976486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3341802 
End bp3343145 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content62% 
IMG OID643392560 
Producttransposase IS1182 family protein 
Protein accessionYP_002277397 
Protein GI209545168 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000434143 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.286719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCT TCATTCCGTT TGATCGATCG CAGCCGTATC TTCTGCCGCC GGATCTGAAG 
TCATGGCTGC CGTCGGACGA TGTGGCGCAT TTCATCGTGG CGGCGGTAGA GCGGGTGCCG
TTGAGGGCAT TTTCCGTTCC TGTGCGGACT GGCGGCAAGG CGCAGTATCA TCCGCGCCTG
ATGCTGGCGC TGCTGATTTA CGCCTATGCG AACGGCGTGT TCTCATCGCG TCGGATCGAA
CGGGCGACAT ATCGTGATAT CGGTATGCGC TTTGTGGCGG CGAACCTGCA TCCTGACCAT
GACACGATCG CGACGTTCCG GCGCGGCAAC CGCACGGCGA TCGAGGCAGC GTTCATGCAT
GTACTCCTGC TGGCACGCGA GACGGGACTG GTGCGGCTTG GCACGGTGTC GATCGACGGC
ACGAAGATCG ATGCCAATGC CTCGAAATAC CGTTCCATTC GTTATGATCG CGCGAAAGAG
CTGCGCGAGA AACTGGCCAC CGATATCTCC ACCCTGATGG AACGGGCAGA GGCGGCGGAT
ACAACCGATG TGGATCATCA GGCGTTGCCG GAGGAACTGG CCCGGCGGGA GGCTCTGAAG
GCAAAGCTGG ATGAAGCCTG TGCGCGACTG GAGGCGGAAG CCCGCGAGCA GGCCAAGACC
GCCCGACCAG AATATGAGCG CAAGAAGGCA GCTTTTGATG CGAAGCGGGG ACGGCGCGGT
CGGCCGCCGA AAGAACCGGA CGATGAACCG CCACCAGACC GGCAGATCAA CCTGACCGAT
CCGGACAGCA AGCTGATGCG CCGCTCCGAC GCGCATGAAT ACCGGCAAGC CTACAATGCC
CAGGCCGTGG TTTGTGCCGA GGGCAGCCAG TTGATCTTGG AAAATGGCGT CGTTGCGACG
ACGGCGGACG CGCCCAGCTT CGCCGCCACC ATCCTGGGTA TGGAGGAGAG GATCGGCCTG
CCACGAACCG TCCTCGCCGA CACGGGTTTC GCCAGCGGCA AAGCCGTCGA AACGTTGCAG
GCCAGCGGCG TGGACCCGCT GGTCGCCATC GGACGCCCTG TGAATCGGCG CCCTTATGAC
TTCCGGCCAG AACCGCCACC CAGGGAGCCG CGCCGGATCA CCGAGCCCTG GCGCCTGGAA
ATGAAGGCCA GGCTGCAACA GAACCCGGCA AAAGCCCTTT ACGCCTTACG CAAGCAGACC
GTCGAACCGG TCTTCGGTAT CATCAAGAGC GCCATGGGCT TCACCCGTTT CCATCTCCGT
GGCCTCCCCA ACGTCGCAAC AGAATGGACG CTCGTCGCCC TCGCATATAA TTGCCGTAGG
ATCACGCGAC TGACGGCCGC ATAA
 
Protein sequence
MSSFIPFDRS QPYLLPPDLK SWLPSDDVAH FIVAAVERVP LRAFSVPVRT GGKAQYHPRL 
MLALLIYAYA NGVFSSRRIE RATYRDIGMR FVAANLHPDH DTIATFRRGN RTAIEAAFMH
VLLLARETGL VRLGTVSIDG TKIDANASKY RSIRYDRAKE LREKLATDIS TLMERAEAAD
TTDVDHQALP EELARREALK AKLDEACARL EAEAREQAKT ARPEYERKKA AFDAKRGRRG
RPPKEPDDEP PPDRQINLTD PDSKLMRRSD AHEYRQAYNA QAVVCAEGSQ LILENGVVAT
TADAPSFAAT ILGMEERIGL PRTVLADTGF ASGKAVETLQ ASGVDPLVAI GRPVNRRPYD
FRPEPPPREP RRITEPWRLE MKARLQQNPA KALYALRKQT VEPVFGIIKS AMGFTRFHLR
GLPNVATEWT LVALAYNCRR ITRLTAA