Gene Xaut_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4894 
Symbol 
ID5420545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009717 
Strand
Start bp109794 
End bp112793 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content59% 
IMG OID640873558 
Producttransposase Tn3 family protein 
Protein accessionYP_001409338 
Protein GI154243765 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAC GGAGGCTTCT CAAGGTCCAA GATCGACAGA GACTTTTCGA TATACCAACC 
GATGAGGACG GCCTCATCCG GCACTATTCG TTGTCGTCGG CTGACAGGCT TGAGATTGGA
CTTTGCAGAC GAGAACACAA TCGGCTCGGA TTTGCCGTTC AGCTCTGCCT GATGCGATAT
CCAGGCAGGG TGTTGGCGAC CGATGAAACT CCGCCTCGCG CAATGCTAGA GTACGTTGCT
GAGCAGATTG GCGCCGACGC TGGAAAGTTT GCGCTCTATG CACGCCGTGA AGAAACGCGG
CGCGATCACA TTGCTCGCTT GATGGTTTAT CTGGCCGCGC GGAGCGCGAC GGGGCAAGAC
CGTAGGGCTG CGCTGTTGGC TGCAATTCAG GCGGCCACGA TGTCCGACGA CGGTGGCGCG
ATAGCGAGTG CTACTGTCGC CATGTTTCGT GAACGCGGAT CTCTTCTGCC AGCAATCGAC
ACGATCGAAC GGATCGGTCT TGCTGCCCGC GCCATTGCCC GTCGGCGGGC AGAGAGAGCG
CTGATCGAAG AAATTTCGGT CGATACGCTT CAATCGTTGG ATAAGCTGTT GGAGGTTGAC
CCGGCCATCG GCCAGACGCG ATTTCACTGG CTGCGATCAG CGCCGGATGC GCCAGGTACG
TCAAACCTGG TCGGGCTGAC CGAACGGATT GCCTTCCTGC GCGAGCTAGA AATCGATCCG
AGATTGCAGA TACGCATATC GTCTGGACGG TGGGATCAGA TGATCCGTGA AGGCAACGCC
ACACCGGCAT GGCTGGCCAA CGACTTCAAT GCCAGCCGTC GACACGCGCT GATCGTGGCG
CAGATTATCA AGCTCGGCCA GAAGCTCACG GACGATGCAG TGTCGATGTT CATCAAGCTG
ATAGGTCGGC TGTTCTCGCA AGCCAATAAC CGCAAGAAGC AGCGGCACAT GGACTGCAGG
CCGGATACCG CCAAAGCGCT ACGCATGTTC CTGGACACGA TCACAGCCCT GCAGTCCGCG
AACGATTATG GCCGGAACGC ATTGGAGGTT CTCGATCAGG AAGTTGGATG GCACCGGTTG
CTTCGGATGA AGCCTGAGCT TGAGTCGATG GTCGACGACA ACGAGGCATC GCCCTTGACC
TTAGCGGTCG AGCAATATGC CACCGTCAAC AAGTATGCCG GTGCGTTTCT GCAAGCGTTC
ACGTTCCGCT CAGCGCGCCG CCACGATCCC CTTCTTGCGG CGATTTTCCT GCTGAAGCGG
CTCTATGCCG AGAAGCGGCG GACCCTTCCG GATCGCGTCC CGGTCACCCA CCTCAGCCAA
GTTGATCGAC GGCTAATCCT CGGGCAGGAG AAGCCCGATC GCCGTCTCTA TGAGATTGCA
ACCCTCGCGG CTTTGCGAGA CCGGCTTAGA TCTGCGGACA TTTGGGTCGA TGGCAGCCGA
TCCTTCCGAC CGATCGACGA GCACCTGATG CCGCGGTCAA CGTTCACCAT CCTGAAAGAT
GAAGATCGCC TCGGACTTGG TGTCCAAGAA GACGGCGCGG CGTGGCTTAC CGAAGCGCGG
CAGATGCTCG ACTTCAACCT GAAGCGCCTG GCGTACAGGG CACGATCCGG GAGGCTCGAA
GGTGTTCGCC TTGAAGCTGG TACCTTGATC GTCACGCCGA CCGCCGGCGA GGTTCCTGCT
GCAGCGGAGG AACTGAACGC CGAGATCAGC GAGCTTTATC CGTTGGTCGA GGTGCCGGAC
CTCCTGCGGG AAGTGCACGA ATGGACCGGC TTTGCGGATT GCTTCACGCA TGTTCGAACG
GGTGACACTC CGAGGAATGT CTCGGCCATG CTGGCTGGCG TACTGGCCGA TGCGACCAAT
CTCGGTCCAA AGCGAATGGC CAGCGCGTCC AAAGGCATCA GCGCTCACCA GATCAGTTGG
ATGCGAGCCT TCCATGCCCG GTCAGAGACC TACCGCGCGG CCCAGGCCTG CGTGACGGAC
GCACACACCC GCCATCCGCA TTCTTGCCTT TGGGGCAATG GCACGACGTC ATCATCCGAT
GGCCAATTCT TCCGAGCAAG CGACCGAGCC GCAAAGCGCG GAGATATCAA TCTACATTAC
GGCAGTGAGC CCGGATCGAA GTTCTACAGC CATCTGTCAG ATCAGTACGG CTACTTCAGC
ATCTTGCCCA TCAGCCCGAC CGAAAGCGAG GCTGCCTATG TGCTCGACGG ACTATTCGAT
CAGGACACAA TCCTCGAAAT ACAGGAGCAC TTCACCGACA CCGGCGGCGC GAGCGATCAC
GTCTTTGGGC TATTCGCTCT GATCGGCAAG CGGTTCGCAC CACGACTGCG CAATCTCAAA
GATCGGAAGT TCCACACGTT CGAGAAAGGC GATGCATACC CGGCGCTGTC GAACCACATC
GGGGCGCCGA TCAACACCAC CCTGATCCTC GATCACTGGG ATGATCTGCT TCATCTCGCG
GCATCGATCA CCACCCGTGC CGTTGTGCCC TCTACGATTT TGAAGAAGCT CTCGGCATCA
CCGAAGGAAA GCCAGCTGGC CAAGGCTCTT CGGGAACTCG GCCGCATCGA GCGGTCGCTC
TTCATGACCG AATGGTACTC GAACTCGACA TTGCGCCGGC GCTGCCAAGC CGGCCTCAAC
AAGGGCGAGG CAGCGCACAA ACTCAAACGC GCAGTCTTCT TCCATGAGCG TGGCGAACTC
CGCGACCGGT CGTTCGAAAG TCAGGCATTC CGCGCATCGG GCCTCAATCT TGTCGTCAGC
GCGATCGTCC ACTGGAACAC GGTCTATCTC GACCGCGCGG TCAAAGAGCT CAAACGAGCG
GGAAGGAACA TTCCAGAGTC CCTGTTGAGG CATATCTCGC CACTGAGTTG GGAGCATATC
AACCTGACAG GCATCTACAC CTGGGACAGC GAGCAACATC TCCCGGAAGG CTTCAGATTG
CTTCGCCTCC CGGCTGGGCT ACGGCGTGCC GCACAACGTT CCTGCTCCGT TCGACCTTAG
 
Protein sequence
MGKRRLLKVQ DRQRLFDIPT DEDGLIRHYS LSSADRLEIG LCRREHNRLG FAVQLCLMRY 
PGRVLATDET PPRAMLEYVA EQIGADAGKF ALYARREETR RDHIARLMVY LAARSATGQD
RRAALLAAIQ AATMSDDGGA IASATVAMFR ERGSLLPAID TIERIGLAAR AIARRRAERA
LIEEISVDTL QSLDKLLEVD PAIGQTRFHW LRSAPDAPGT SNLVGLTERI AFLRELEIDP
RLQIRISSGR WDQMIREGNA TPAWLANDFN ASRRHALIVA QIIKLGQKLT DDAVSMFIKL
IGRLFSQANN RKKQRHMDCR PDTAKALRMF LDTITALQSA NDYGRNALEV LDQEVGWHRL
LRMKPELESM VDDNEASPLT LAVEQYATVN KYAGAFLQAF TFRSARRHDP LLAAIFLLKR
LYAEKRRTLP DRVPVTHLSQ VDRRLILGQE KPDRRLYEIA TLAALRDRLR SADIWVDGSR
SFRPIDEHLM PRSTFTILKD EDRLGLGVQE DGAAWLTEAR QMLDFNLKRL AYRARSGRLE
GVRLEAGTLI VTPTAGEVPA AAEELNAEIS ELYPLVEVPD LLREVHEWTG FADCFTHVRT
GDTPRNVSAM LAGVLADATN LGPKRMASAS KGISAHQISW MRAFHARSET YRAAQACVTD
AHTRHPHSCL WGNGTTSSSD GQFFRASDRA AKRGDINLHY GSEPGSKFYS HLSDQYGYFS
ILPISPTESE AAYVLDGLFD QDTILEIQEH FTDTGGASDH VFGLFALIGK RFAPRLRNLK
DRKFHTFEKG DAYPALSNHI GAPINTTLIL DHWDDLLHLA ASITTRAVVP STILKKLSAS
PKESQLAKAL RELGRIERSL FMTEWYSNST LRRRCQAGLN KGEAAHKLKR AVFFHERGEL
RDRSFESQAF RASGLNLVVS AIVHWNTVYL DRAVKELKRA GRNIPESLLR HISPLSWEHI
NLTGIYTWDS EQHLPEGFRL LRLPAGLRRA AQRSCSVRP