Gene Sama_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3041 
Symbol 
ID4605288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3617593 
End bp3619863 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content57% 
IMG OID639782457 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_928913 
Protein GI119776173 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG CCATTGACTT AAGCCTGGAT GGTGTGGAGC AAATGCCCTT GAGGCGCTTC 
ACCGAAGAGG CCTATCTCAA CTATTCCATG TACGTAATCA TGGACCGGGC CCTTCCCCAT
ATTGGTGATG GCCTCAAACC GGTGCAAAGA CGTATCGTCT ATGCCATGAG TGAACTGGGC
CTTAATGCCC AGTCGAAGCA CAAGAAATCT GCCCGTACGG TGGGTGACGT GCTGGGTAAA
TATCACCCCC ACGGTGACAG TGCCTGTTAC GAAGCCATGG TGCTGATGGC GCAGCCGTTT
TCCTACCGCT ATCCGCTGGT GGATGGTCAG GGTAACTGGG GTGCTCCTGA CGATCCTAAG
TCCTTCGCGG CCATGCGATA TACAGAAGCG CGGCTGTCGC GCTTCTCTGA GGTACTGCTC
TCTGAGCTCG GCCAGGGCAC TGTGGATTGG GGCGTCAACT TTGACGGCAC CATGAAAGAG
CCCAAGGTGT TGCCGGCCCG CTTGCCCCAC ATTCTGCTTA ACGGGGTAAC CGGTATTGCT
GTGGGTATGG CGACGGATAT ACCGCCCCAC AACGCCCGGG AACTGACGCG AGCCTGTATC
GAGTTAATTG ATAATCCCGC CACTGAGCTT AAGCGCTTGA TGGAGCTCGT ACCGGGTCCT
GATTATCCCA CAGAGGCGGA AATCATTACC CCCGCCGATG AAATCGCCAA GATTTATGAG
TCCGGCCGTG GCTCCATCAG GATGCGCGCT GTTTACGAGA TGGAAAACGG TGAGGTGGTG
ATTTCTGCGC TGCCGCATCA GGCCAGCTCC AGTAAGATCC TCGAGCAAAT TGCGGCTCAG
ATGCAGGCGA AAAAGCTGCC GATGGTGACG GATTTGCGCG ACGAATCCGA CCACGAAAAT
CCGGTGCGAC TGGTGTTGGT ACCACGTTCT AATCGGGTGG ATGTGGAACA GCTGATGGCC
CACCTGTTTG CGACCACCGA GCTTGAAAAG AGCTACCGGG TTAACCTGAA TGTGATTGGC
CTGGATGGCC GTCCCAAGGT AAAAGGCCTG AAGGACATGC TGGCCGAGTG GCTGGTTTTC
CGTACCGAAA CTGTACGTCG CCGTCTGCAA TTCCGTCTGG ACAAGGTGCT GGCAAGGCTG
CACATCCTCG AAGCCCTGAT GATTGCCTTC CTTAACATCG ATGAAGTGAT TGAAATCATT
CGCTTCCACG ATGAGCCCAA GGCTGAGCTG ATGGCGCGTT TCGGGCTGTC GGACCGTCAG
GCCGAGGCCA TTTTGGAACT CAAGTTGCGC CATTTGGCCA AGCTCGAAGA GATGAAAATC
AAAGCCGAGC AAAGCGAGCT TGAAGAAGAG CGCAATAAGC TGGAGCAGAT CCTGGGGTCG
GAGCGTCGTC TCAAGACCCT GATCAAGAAA GAACTGGAAA AGGATGCCGA AACCTACGGT
GACGATCGCC GCTCGCCTCT GGTCGTGCGT GGCGAGTCCC GCGCCCTTAC CGAACAGGAG
TTGCTGCCCA CCGAGCCTGT GACAGTTGTG CTTTCTGAAA AGGGTTGGGT ACGCTGTGCA
AAGGGCCACG ATATAGACGC AGGTGCTCTG TCTTATAAAG CGGGCGATGG CTTCCTGTGT
GCGGCGCAAG GGCGCAGCAA CCAGCAGAGT GTGTTTATCG ACAGCTCTGG CCGAGCCTTC
TCAACCGATA CCCACACCCT GCCGTCGGCG AGAAGTCAGG GCGAGCCCAT TACCACTCGC
TTTAATCTGG CGCCTGGCGA AACCATGGAG CATGTGCTCC TGGGTGAAGA GCAGCACCAC
TACCTGCTCG CCAGTGACGC AGGTTATGGT TTTGTCTGCT CTTTTGCCGA TATGATTTCC
CGTAACAAGG CGGGTAAGGC GCTGCTCAGT TTGCCATCCG GCGCCAAGGC GTTGGCCCCC
CGTCGGGTTG ACAAAGACGC CAATCCCTCC ATTTTGGCCA TCACCAATGA AGGCCGCATG
CTGGTCTTTG GACTGGATGC GCTGCCGCAG CTGGCGAAAG GTAAGGGCAA TAAGATTATC
GGTATCCCCA GTGAACGGGC CAAAAACCGA GAAGAGCTGT TGATACACCT GCACCTGGTG
CCAGCCGACT CGGCCGTGAC CCTGTGGGCT GGCAAACGTA AGCTGACGCT GAAGAGTTCC
GATCTCGAGC ATTACCGGGG TGAGCGTGGC CGACGCGGTG CCAAGCTGCC AAGGGGCCTG
CAGCGGGTTG ACAGTGTCGA GCTGGGGGAA GGGGATACCC CCGCCTTGTA A
 
Protein sequence
MSDAIDLSLD GVEQMPLRRF TEEAYLNYSM YVIMDRALPH IGDGLKPVQR RIVYAMSELG 
LNAQSKHKKS ARTVGDVLGK YHPHGDSACY EAMVLMAQPF SYRYPLVDGQ GNWGAPDDPK
SFAAMRYTEA RLSRFSEVLL SELGQGTVDW GVNFDGTMKE PKVLPARLPH ILLNGVTGIA
VGMATDIPPH NARELTRACI ELIDNPATEL KRLMELVPGP DYPTEAEIIT PADEIAKIYE
SGRGSIRMRA VYEMENGEVV ISALPHQASS SKILEQIAAQ MQAKKLPMVT DLRDESDHEN
PVRLVLVPRS NRVDVEQLMA HLFATTELEK SYRVNLNVIG LDGRPKVKGL KDMLAEWLVF
RTETVRRRLQ FRLDKVLARL HILEALMIAF LNIDEVIEII RFHDEPKAEL MARFGLSDRQ
AEAILELKLR HLAKLEEMKI KAEQSELEEE RNKLEQILGS ERRLKTLIKK ELEKDAETYG
DDRRSPLVVR GESRALTEQE LLPTEPVTVV LSEKGWVRCA KGHDIDAGAL SYKAGDGFLC
AAQGRSNQQS VFIDSSGRAF STDTHTLPSA RSQGEPITTR FNLAPGETME HVLLGEEQHH
YLLASDAGYG FVCSFADMIS RNKAGKALLS LPSGAKALAP RRVDKDANPS ILAITNEGRM
LVFGLDALPQ LAKGKGNKII GIPSERAKNR EELLIHLHLV PADSAVTLWA GKRKLTLKSS
DLEHYRGERG RRGAKLPRGL QRVDSVELGE GDTPAL