Gene Gura_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1668 
Symbol 
ID5164143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1934620 
End bp1935906 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content52% 
IMG OID640549164 
Producttransposase 
Protein accessionYP_001230436 
Protein GI148263730 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000920779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAG AGATCGTTTT CCCCGGCATC GAAGAATATA TGGCTCCCTA TTATGGCTAT 
TTCCATCGGT CAGAGAGCCG TGAACTGGCA GAATGTTACT TGGCCGGCCT GCTCATGGAC
GGTGAGCGCA AGTCAGTTGA ACCCATGTCA GAGAAGGTAA ACGCATCTGA ACGAAGTATG
CAGCGCCTCC TTTCGACTGC CAAATGGGAC GATCAACTTG TTGCTGAGCA ATTCCGCCGT
TCCATGCTTG ACGTCACTTC CGACCCGCAG GGGATCCTGG TTCTTGATGA TACCGGGTTC
CCTAAGAAAG GGTACGACAG TGTATGTGTT GCCCGGCAAT ACTGCGGTGC ATCAGGCAAG
ACTGACAACT GTCAGATTGG CGTAAGCATG ACGTATGTCG GCAGAGATGT CGCCTGGCCA
TATGCCATGG AACTGTTCGT CCCGGAATCC TGGGATCAGC AAAATGATGA TTGCACCGCA
AAGCGTAAAA AGGCTCACAT GCCGGAGTCA GTGCACCATA AGTCAAAATG GCGCATGGCA
CTTGATTTTG TTGACCTGGC CCGAAAAGAC AATGTTCCCC ATCGTGCAGT CCTTGCTGAC
AGCTGGTATG GCAACATTCC GGAGTTTCGC AAGGAGCTTG AGTCCCGCAG TGAAAATTAC
ATCCTGGGAG CTTACTCCAA CACCCCGGTA TTTCTTGAGG AGCCGGTCTT TGAAATTGCG
CCAGTCAAAG AGCATAAGCG AGGGCGTCCA CGAACTCGCC CTAAGGTAGT CTCCACAAAC
CCCGAACCGG TCAAGCTGTC GGTACTGGGC GAAAGCATTG CCGATGATGC ATGGCAACGG
CTAGAATTGA GGCTCAATTC CAAGGACAAG CCACTTGTTG CAGAGGCCGT CTCAATGAGA
GTGTGGCCGG CTCACGGATG GCGGCAGGGC AATCATCATG AACAAGTCTG GCTCCTGATA
GAGCGCCGCC CCCTGAACCT GGGTGGATAC GAGCTTCGCT ATTTCTTCAG CAATATGCCG
CAGCATCTGG CAACGATTGA CCTTGCCCGC CTCTACCATG AACGTTATTG GATAGAGCAT
GGCTATCAAC AGCTAAAGGA AGAGCTTGGC CTTGATCACC ATGAAGGGCG CTCATGGAGC
GGATGGCATC GACATGTGCT CCTGACGTCC CTGGCATATG GCTATCTGAC ACTGTTGCGT
TTGCAGCAAA AAAAACAGAA GAGTGCGACA GCGCGGAGCA ACTGGATTCA GAAAAAATCG
ACACTGGCCA ACGACGCTTT GTTCTGA
 
Protein sequence
MTTEIVFPGI EEYMAPYYGY FHRSESRELA ECYLAGLLMD GERKSVEPMS EKVNASERSM 
QRLLSTAKWD DQLVAEQFRR SMLDVTSDPQ GILVLDDTGF PKKGYDSVCV ARQYCGASGK
TDNCQIGVSM TYVGRDVAWP YAMELFVPES WDQQNDDCTA KRKKAHMPES VHHKSKWRMA
LDFVDLARKD NVPHRAVLAD SWYGNIPEFR KELESRSENY ILGAYSNTPV FLEEPVFEIA
PVKEHKRGRP RTRPKVVSTN PEPVKLSVLG ESIADDAWQR LELRLNSKDK PLVAEAVSMR
VWPAHGWRQG NHHEQVWLLI ERRPLNLGGY ELRYFFSNMP QHLATIDLAR LYHERYWIEH
GYQQLKEELG LDHHEGRSWS GWHRHVLLTS LAYGYLTLLR LQQKKQKSAT ARSNWIQKKS
TLANDALF