Gene Sbal223_4399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4399 
Symbol 
ID7090186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011664 
Strand
Start bp74374 
End bp75504 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content47% 
IMG OID643463269 
Productintegrase family protein 
Protein accessionYP_002360281 
Protein GI217975611 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.685705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0426652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGTA TTAAACTGCA ACAGTCAATG ATTGATAATT TACCCACACC ACAGAAAATA 
ACGGTGTTTT ACGATAAATA TTTTCCCGCT TTTTTTATGC GAGCCTATCC CTCTGGTATT
CGGTGTTATT ACGTTCGCTT CCAGTATCAA GGTTTGCGAC AACTTCATGT TATCGGCAAT
GCAAATGATA TCACGCTTGA TGATGCGCGA CTTGAGGCAA GAAAGCACAT AGATGCGTTA
CGTTATGGGG TGGTGCGTGA ACCTGCGCCC TCTCTTACGC TCTGTGCGTT CGCAGAAGAG
TTTTTTCCAA GATATGCCCG TCATTGGAAG CCCTCAACAT TACTCAATAG CCAGCGTGGT
TTTACGCGGC ATATCGCCCC TGTGTTGGGT GATGTTCCGC TTGCCGCGTT GACACGACAA
CACATAGAAC AGTGGTTTGA TGGTATGCAT GCCTCGAAAG GTATGGCTAA TCGCTTGTTG
CCGTTGTTGT CGGTAATGAT GCAGCAAGCC GAAGTATATG AGTATCGTCC AGCGCAAAGT
AACCCTTGCA AAGGCTTTAA GCGCTATAAG TCTACGCACA GTGAGCGTTA TTTATCTGAG
GAAGAACTCA AGCGGCTATG GTTGGCTTTA GATAGCCACG AAAAAGCGTC ACCCGTTGCG
GTGATGGTCT TACGGTTACT AATCTTAACG GGTTGTCGCT GTAACGAAGT GTGCTCCGTC
AAATGGGCTG ATTATCGGCA AGGGCATTGG TATTTGCCCG ACAGTAAAAC AGGCGCTAAA
ACAGTGTTTC TATCTTCCTT TGCGCGGGAG TTGCTGAGTG ATTGGCCACA AGTAAGCGAG
CATCTTTTTT GGCATCAATC ACCTTCACAG CCGTTTACCC CTGTTTGCTT AGACCGATTT
TGGCGGCCAT TTCGCGAGAC AATTCATCTT AATGATGTTC GCATCCATGA CTTACGCCAC
ACCTACGCCA GTATTGCAGT AAAGCACAAT ATCAACATCC TCACGATTGG CCGCTTATTA
GGGCATGCCT TACCTGAAAC AACCTTAAAG TACACACACC TTGCTAAACG TGATGTTCAA
CAGGCGGCAA ATGTCGTTTC TCAACTCATT GCAGGGGAGA TGCAACGATG A
 
Protein sequence
MPSIKLQQSM IDNLPTPQKI TVFYDKYFPA FFMRAYPSGI RCYYVRFQYQ GLRQLHVIGN 
ANDITLDDAR LEARKHIDAL RYGVVREPAP SLTLCAFAEE FFPRYARHWK PSTLLNSQRG
FTRHIAPVLG DVPLAALTRQ HIEQWFDGMH ASKGMANRLL PLLSVMMQQA EVYEYRPAQS
NPCKGFKRYK STHSERYLSE EELKRLWLAL DSHEKASPVA VMVLRLLILT GCRCNEVCSV
KWADYRQGHW YLPDSKTGAK TVFLSSFARE LLSDWPQVSE HLFWHQSPSQ PFTPVCLDRF
WRPFRETIHL NDVRIHDLRH TYASIAVKHN INILTIGRLL GHALPETTLK YTHLAKRDVQ
QAANVVSQLI AGEMQR