Gene Xaut_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3039 
Symbol 
ID5424127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp3377071 
End bp3378492 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content60% 
IMG OID640882285 
Producttransposase IS4 family protein 
Protein accessionYP_001417926 
Protein GI154246968 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.409496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTA TTGAGGGACT CGCGCGAGAT CAGGTCAACC TGCTTCCTCC TTGTGTTGAT 
GACTATGTTT CCCCGGACGC ATTGGTCCGA GTCGTCGATG CTTTTGTTAC CAGCTTGAAC
TTGGCTGAGC TTGGCTTCGG TCGCGCTATC GCTGCGGTCA CCGGCCGCCC TGGATACCAT
CCAGGCGATA TGCTCCGGCT GTACATCTGG GGCTACCTCA ACCAGGTACG GTCCTCACGC
CAATTGGAAC GAGCGTGTGT CCGCGACCTC GAAGCGCTTT GGCTAATGCG CCGGCTCGCC
CCGGATTACC GAACGATCGC CTCCTTTCGT CATGACAATC CGGAAGCCAT TGTCGGCGCC
AGCGCTGCAT TCATCCAGTT CTGCCGCGAA ACCGGCTTGA TCAGCGGTCG ATTGGTCGCG
CTGGACGGGA CGAAGATGCG CGCGGTCGCG AGCCCAAAGA ACATCGCTGG CGCCGACCGG
CTGGCCCGTG ACGTTGCGCA CACCGAAAAG GAGATCGCCT ACTACCTTGA ACGGCTCGAC
ATCATAGATG AGGCAGTGGC CCAGGGGTTC GACGATCAGC CCAAACATCG GGAGGCGTTC
ACCACTGCGA TGGAGACCCT CGGGCGCCGC AAAGACAGGC TCGTGCGCCG GCAGGACATC
CTGAAGGATC GCGACGAGAC GGCTTTGGTC TTTGGCGAGT CCGACGCGCG GCCGATGGGC
TATGGACGTT CTCCCAAGAC ACCCTGCTAC AACATGCAAA GCGTGGTCGA TGTAGATAGC
GGCCTGATCA TACATCACGA CGTGACCAAC GAGGCAAACG ACAGCCAGCT CCTGCATCCA
ATGTCGATGG CGACGATGGA GGTGCTTGAG GTTGACGAGC TCAAAGTCCT GGCCGACGGC
GGTTACTCCA ACGCCCAGGC GGTCGCGCAA TGCGAGCGCG ACCATATTGA GGTCGCGGCG
CCGATCAAAC GCGGCGCCAT GAGCACCGAC TTTTTCCGGC CAGCGCAGTT CGTGTATGAT
GAGGAGACCG ACACAATCCG GTGCCCCGCC GGCAAGACGT TGAGACCATC CGGCAAACAT
ACCCGCAACC GTGCGATCCG ATATAGAACG CCCGCATGCA AAGACTGTCG GCTGAAGAGC
CGATGCACGT CCGGCGCCCA ACGGACCATC CATCGGTTGT TCGATCAGGC GGCGCTGGAT
CGTATGGAGG CCAGAATCTA CGCGGATCCG AGCTTGATGG TGACCCGCCG ATGTACTGTA
GAGCACCCCT TCGGCACGAT TAAACGGATG TCCGGCGGCG GAAGGTTCCT CACGCGAGGT
CTCAGAGCGG TAAAGGCCGA AGCGGCTCTC TCGATTGTCG CCTTCAACAT CCTCCATGCA
GTAAATGCCT TCGGTGCCGA GCGACTGACG CCAGCGGGGT GA
 
Protein sequence
MSFIEGLARD QVNLLPPCVD DYVSPDALVR VVDAFVTSLN LAELGFGRAI AAVTGRPGYH 
PGDMLRLYIW GYLNQVRSSR QLERACVRDL EALWLMRRLA PDYRTIASFR HDNPEAIVGA
SAAFIQFCRE TGLISGRLVA LDGTKMRAVA SPKNIAGADR LARDVAHTEK EIAYYLERLD
IIDEAVAQGF DDQPKHREAF TTAMETLGRR KDRLVRRQDI LKDRDETALV FGESDARPMG
YGRSPKTPCY NMQSVVDVDS GLIIHHDVTN EANDSQLLHP MSMATMEVLE VDELKVLADG
GYSNAQAVAQ CERDHIEVAA PIKRGAMSTD FFRPAQFVYD EETDTIRCPA GKTLRPSGKH
TRNRAIRYRT PACKDCRLKS RCTSGAQRTI HRLFDQAALD RMEARIYADP SLMVTRRCTV
EHPFGTIKRM SGGGRFLTRG LRAVKAEAAL SIVAFNILHA VNAFGAERLT PAG