Gene Oant_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_4669 
Symbol 
ID5383258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009671 
Strand
Start bp31424 
End bp34390 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content62% 
IMG OID640837428 
Producttransposase Tn3 family protein 
Protein accessionYP_001373268 
Protein GI153012057 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAGC ACGAACTGCT GAACGAAGCC GAGCGGGAAC AGTTGCTGGG TTTTCCGACG 
AGCCGAGACG ATCTTGCCCG ACTTTACACC TTCGAACCGC ACGACCTCGA CCTGATCCGG
CTTCGCCGAG AGAATCGCAA CCGTCTCGGT GTTGCTGTTC AACTTGCGCT GTTTCGCCAT
CCCGGCATGA CATTGGCACA AATTCTTCAA CGCAGCGCCG GGCTTTCCGA AGAACTCGTG
TCCTTCATTG CCGAGCAGCT CGACCTGCCG GAAACTGCCT TCGCCGCATA TGCCGTCCGC
GACCAGACAA TGACCGATCA TGCCCGCGAG CTCGCCTCGG CACTCGGCCT TCGAGGCGCA
AGCCGGACCG ATATCCCCTT CATGATCGAG GCGGCCGCCA AAGCTGCCTG GGGAACGGAC
AAGGGTGTCG TGATCGCCGC CGGTATCATT GACGCGCTGC GCCAGGCCAA GATTCTTCTG
CCTGCTACCT CGACCATAGA GCGCGCTGGC ATTGCCGGCC GGGCGCGTGC CAGAAAACAG
GCGGCTCATG CGCTCCTGTC CGGTCTTCGG CCGGCTCAAC TGGACGCGCT CGATGCGCTT
CTGGCTGCAG ATTCGACCGG CGCTATGCCG CTCACCTGGC TCAAGACGAT CCCGGTGGCT
GCAAAGCCCG ATCACGTTCG AGGCATTCTC GATCGTCTGG GCGTGGTGCG GAGGATCGGC
ATCCCGCCGA AATTAAGCGC CGCCATCCAC CCCGCCCGCT ACCGGCAGTT CGTGCGTGAG
GGGCGCGTGT CCCCCGCCTA TATGATCGAG CGTTACACGA CGTCGCGCCG GCGGGCGACG
CTCGTTGCAT TTCTGATCGA CCTCGAAGAG CGGCTTACCG ACGCGGCCAT CGAAATGGCA
GACAAGCTGA TCGGCGGTGC GTTCAGCCGT GCCCAGAACA AGCAGGCCCG CCGCTATGGG
GCGACCGCGA AAGACGTCGC CCGGCTGATG CGACTGTTTC GCGGTACGAT CGACGCTCTT
GCCGGTGCTA TCGACAACCA CTCGGACCCG GTCGAGGCAA TTGACGAAAC TGTCGGCTGG
ATCAATCTGC TCAAAGCCAG ACATGAGATT GCCGAGCTCG CCGAAACTGC CGATGTCGAT
CCCCTCACGG TGGCCGTGGA CCGCTATGCG ACATTGCGGA AGTTTGCTCC GGCGCTGATC
GAAGTTCTCG AATTCAAGGC CAACCGAGGA AGCACGCGAA CGATAGCCGG CGTTCAAAAG
CTTCGCGAAC TCAACAGGTC GGGCAGGCGC GATGTGCCGC CGGATGCGCC GATGCCGTTC
AAGGAGGAAT GGCGGAAGCT GGTGATCGAG CCGGATGGCA AGATCAATCG TCGGCTCTAC
GAAACGGCTA TGCTGGCGCA CTTGCGCAAC AAGTTGCGTT CCGGGGACGT CTGGGTTGAG
CGGTCGTCAG CCTATCGCCG CTTCGACAGT TACCTTCTGC CGGAACCGGC CGCCGCTCCA
ATCGTCGCCG AACTCGGATT GCCGACCGCC GCCGACACAT GGCTCGAAAA GCGCGGCCGG
GAACTGGACT GGCGGCTGAA GAAATTCGCG CAGCGCCTCA AACGCAACCA GCTTGAAGGC
GTCCGCTTTG CCGAAGATCG GTTGCAGGTC TTACCCGTCA AGACCGCAGT TCCCGACGAA
GCCGAAGCAC TCGCCGATCG CCTGGACGCA ATGATGCCGC GCATCCGCAT CACCGAACTG
CTGCATGAAG TGGCGCGCGA GACCGGATTC ATGGCGGCCT TCACCAATCT GCGCACCGGC
GAAAACTGTC CGAACGAGAG CGCGCTGCTG GCCGCAATTC TGGCCGATGC GACCAATCTC
GGGCTCTCCC GCATGGCGGC CGCCAGCCAC GGCGTCACCC GAGACCAGCT CATCTGGACA
CAAGACGCCT ACATTCGTGA GGATACCTAC CGGGAAGCGC TCGCTACCAT CATCAATGCT
CAACACCGCC TGCCTATCGC TTGGGTCTGG GGTGACGGAA CCACGTCAAG TTCTGACGGG
CAGTTCTTTC GTGGCGGCAA ACGTGGCACG GCAGGAGGAG ACATCAACGC CCGCTACGGC
GTCGATCCGG GCTTCAGCTT CTACACCCAC GTCTCCAATC AACACGGCCC TTTCCACATC
AAGGTTCTCT CGGCGGCGAC GCACGAGGCG CCCTATGTGC TCGACGGGCT ACTTCACCAC
GGCACCAATC TCTCGATTGC GGAGCATTAT ACCGACACTG GCGGCGCGAC CGATCATGTG
TTCGCGCTCT GCGCCATGCT TGGGTTCCGT TTCTGCCCGC GGCTACGCGA CTTCCCGGAT
CGCCGACTGA TCCCGATCGA GCATCCCGCG GGCTATCCCG AAATCGCGCC GCTCCTCGGC
AAACGCATCC GCACCGATGT CATCCGCGAA CATTGGGATG ACGTGATGCG TCTGGTCGCT
TCGCTCAAGA CCGGCCATGT CGCGCCATCG GTGATGTTGC GGAAGCTCTC GGCCTATGAG
CGTCAGAACA AGCTGGACAT AGCGCTTCAG GAGATCGGCA AAATCGAGCG AACCCTGTTC
ATGCTCGACT GGCTGGAAAG TCCGGGGCTG CGACGTAGAT GCCACGCCGG TCTCAACAAG
GGCGAGCAGC GCCATGCCCT CGCGCAAGCG ATCTACACCT TCCGACAGGG ACGCATCATC
GACCGCAGTC ACGAGGCCCA ACAGTACCGG GCCTCCGGCT TGAACCTCGT CATTGCGGCG
ATCGTCTATT GGAATTCCAC CTATATGCGT GACGCCATTG AGCATCTGCG CTCGCAGGGG
GAAGCTGTTT CCGACAATCT CCTGGCTCAT ACCTCCCCGG TCGGGTGGGA GCACATCGCC
TTTTCCGGTG ACTTCCTCTG GGATCGCGCC GCAAAGACGA CCGGCCGGAA ACCGCTCAAC
TTATCTACAA AACAGCGAGT GGCGTGA
 
Protein sequence
MRKHELLNEA EREQLLGFPT SRDDLARLYT FEPHDLDLIR LRRENRNRLG VAVQLALFRH 
PGMTLAQILQ RSAGLSEELV SFIAEQLDLP ETAFAAYAVR DQTMTDHARE LASALGLRGA
SRTDIPFMIE AAAKAAWGTD KGVVIAAGII DALRQAKILL PATSTIERAG IAGRARARKQ
AAHALLSGLR PAQLDALDAL LAADSTGAMP LTWLKTIPVA AKPDHVRGIL DRLGVVRRIG
IPPKLSAAIH PARYRQFVRE GRVSPAYMIE RYTTSRRRAT LVAFLIDLEE RLTDAAIEMA
DKLIGGAFSR AQNKQARRYG ATAKDVARLM RLFRGTIDAL AGAIDNHSDP VEAIDETVGW
INLLKARHEI AELAETADVD PLTVAVDRYA TLRKFAPALI EVLEFKANRG STRTIAGVQK
LRELNRSGRR DVPPDAPMPF KEEWRKLVIE PDGKINRRLY ETAMLAHLRN KLRSGDVWVE
RSSAYRRFDS YLLPEPAAAP IVAELGLPTA ADTWLEKRGR ELDWRLKKFA QRLKRNQLEG
VRFAEDRLQV LPVKTAVPDE AEALADRLDA MMPRIRITEL LHEVARETGF MAAFTNLRTG
ENCPNESALL AAILADATNL GLSRMAAASH GVTRDQLIWT QDAYIREDTY REALATIINA
QHRLPIAWVW GDGTTSSSDG QFFRGGKRGT AGGDINARYG VDPGFSFYTH VSNQHGPFHI
KVLSAATHEA PYVLDGLLHH GTNLSIAEHY TDTGGATDHV FALCAMLGFR FCPRLRDFPD
RRLIPIEHPA GYPEIAPLLG KRIRTDVIRE HWDDVMRLVA SLKTGHVAPS VMLRKLSAYE
RQNKLDIALQ EIGKIERTLF MLDWLESPGL RRRCHAGLNK GEQRHALAQA IYTFRQGRII
DRSHEAQQYR ASGLNLVIAA IVYWNSTYMR DAIEHLRSQG EAVSDNLLAH TSPVGWEHIA
FSGDFLWDRA AKTTGRKPLN LSTKQRVA