Gene Gdia_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2112 
Symbol 
ID6975539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2340061 
End bp2341038 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID643391641 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002276486 
Protein GI209544257 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.838809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGA TCGCCTCCCG TGCCAAGCCT GAAATGCGCG ATACCCCGAT CACGATCGGG 
ATCGACGTCT CGAAAGATCA CCTTGATGCC GCGCTTCATC CGGGCTCCGC GACGCGGCGG
TTCGATAATG ACGCCGCCGG TCATACGGCC CTGCGCCGCT GGATCGGCAA ACGGGCGGTG
GCCCGTATCG TGTTCGAAGC AACGGGGATC TATCACCGTG CCCTTGAGCG CCGCCTGACC
CAGGCGGGCC TGCCCGTCTG CAAGGTCAAC CCGCGCCAGG CCCGGCGCTT TGCCGAGGCG
ACTGGCACGC TGGCCAAGAC CGATCGTGTG GATGCGCTCA TGCTGGCCCG TTTCGGCGTG
GCGCTGGAGC CCGCGATACG CCAGGCCCCC AGTGAAAAGC AGGCGGAACT GGCCGAACTC
GTCGCCGCCC GCGAGGGGCT TTCGCGTGAC CGGACGAGAA CCCTGAACCG CAAGGCCGCC
ACGGAGAACG GCCTGCTCCT GCGTCAGACC CGCCATCGCC TGCGTCAGAT CGAAGCGCAG
ATCGCGGCCA TCGACAAGGC GGTCGCGGCC CTGATCGACG CCGAACCCGT AATGGCGCAC
CGGCGCGATA TCCTGCTCAG CATTCCCGGC GTGGGCGCCA CCACAGCCCA TGCCCTGCTG
GCCAACATGC CCGAACTCGG CAGCATGGAG GAGGGGCAGG CCGGCGCGCT GGCCGGCCTC
GCGCCCATTA CGCGCCAGTC GGGAACCTGG CAGGGAAAGA GCGTCATCCG GGGCGGCCGC
GCTCAGCTGC GCCGGGCCCT CTACATGCCG GCACTGGTCG CCATCCGGCA TAATCCCGAT
CTCAAGCGCC AATATGACGC TCTCGTCGCA CGCGGAAAAC CGGCCATGCT CGCCATCACC
GCCGTCATGA GGAAGCTCAT CGTCCTCGCA AACGCACTCC TGCGACAGGA TCGCTCATGG
GCACCAAAAA TAGCTTGA
 
Protein sequence
MIAIASRAKP EMRDTPITIG IDVSKDHLDA ALHPGSATRR FDNDAAGHTA LRRWIGKRAV 
ARIVFEATGI YHRALERRLT QAGLPVCKVN PRQARRFAEA TGTLAKTDRV DALMLARFGV
ALEPAIRQAP SEKQAELAEL VAAREGLSRD RTRTLNRKAA TENGLLLRQT RHRLRQIEAQ
IAAIDKAVAA LIDAEPVMAH RRDILLSIPG VGATTAHALL ANMPELGSME EGQAGALAGL
APITRQSGTW QGKSVIRGGR AQLRRALYMP ALVAIRHNPD LKRQYDALVA RGKPAMLAIT
AVMRKLIVLA NALLRQDRSW APKIA