Gene Sama_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1026 
Symbol 
ID4603278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1241013 
End bp1242677 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content55% 
IMG OID639780365 
Productrecombination and repair protein 
Protein accessionYP_926903 
Protein GI119774163 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTGTC AGCTCAGCAT CAATAACTTT GCCATTGTTC GCTTTCTGGA ATTGGACTTC 
CGTCCGGGTA TGACCAGTAT TACCGGTGAA ACCGGTGCCG GTAAGTCTAT CGCCATCGAT
GCCCTGGGGC TGTGCCTGGG TAACCGCGCC GACGCCGGTG CCGTGCGCCC CGGAGCCAGC
AAAACCGAGG TCAGCGCCCG TTTCTCTCTG GATGATGTGC CACTGGCCCG GCGCTGGCTG
GAGGACAACG ATCTCGAGCT GGACGACGAA TGTATTCTCA GGCGTACCAT CACTTCCGAT
GGCCGTTCAC GGGCCTACAT CAATGGCAAT CCTGTCCCCC TCTCTCAGCT CAAATCCCTC
GGGCAATATC TGGTGGGGAT CCACGGCCAG CATGCACACC ACGCTCTCTT AAAAAACGAA
CACCAACTGA CGCTGCTGGA TAGCTATGCC AATCACCGCA TGCTGCAGGA GTCGGTCGCC
AGCGCCTATC AGCGCTGTCG TCAGATCGAA GCCGAATTAA GGCAGCTTGA GCAAAGCCAG
CAGGAACGCA TCGCCCGCAA ACAGCTGCTG CAGTATCAGG TTGAAGAATT GGACGAATTC
GCCCTGATGG AAGGAGAGTT TGAACAAATC GAGGCCGAAC ACAAACGCCT CGCCAATGGC
ACCGCACTGA TTGAAGATTG TCAGCACACC TTGTCGCTGC TCAGTGAATC CGACGAAGGC
AATATTGAGT CGCTGTTAAA CCGAGCCTTA TCCCTGGCCG CAACCCTGGA GTCGCTTGAC
CCTACCCTGG CCCCCATTGG GAATATGCTC AATGACGCCC TTATTCAGGT GCAGGAAAGC
AGCAGCGAAC TTGGCCGTTA CCTGTCCAAT CTGGAGCTGG ACCCGGAACA TTTTGCCTAT
CTCGAAGAGC GACTCTCCAA AGCCATGACC CTGGCACGAA AACACCATGT TGCCGCCGAT
AAACTGGCTG CGCATCATCA GGAGCTTGCC AGGGAACTGG CCGGATTGGA TTCCGATGAG
GCGCAGCTGG ATGAACTGAA GCAACAGGTG CAAACCAGCA GGGAGGCTTA TCTCAGTCAC
GCGCAGAAGC TCAGTCAAAG CCGTCTTCGC TATGCCAAAG AACTGGATAA GCAGGTCACC
GCCTCCATTC AGGAGCTCAA CATGCCAAAA GGTAAGTTCA CCATCGAGGT GAACTTCAAC
GACAAGGTGA TGAGTGCAAA CGGCAGTGAC AGTGTGGAGT TTTTGGTTAC CACCAACCCA
GGCCAGCCCC TGTCGCCATT GGCCAAGGTT GCCTCGGGCG GTGAACTGAG CCGCATTGGA
CTGGGTATTC AGGTGATCAC CGCGAAAAAG GTCTCGACAC CCACACTGAT TTTTGACGAA
GTGGATGTGG GGATTTCCGG CCCAACTGCG GCCGTGGTTG GCCGTATGCT CCGTGCCCTC
GGTGAGGCAA CTCAGGTGTT TTGTGTCACC CACTTGCCCC AGGTTGCCGG TAATGGCCAT
CAACATATGT TTGTGAACAA ATCCACTAAG GGTGGACAAA CAGAAACCTC GATGAAGCCA
CTGGACAAAG ACGCCCGGAT ACAGGAACTG GCTCGCTTGC TGGGCGGCGA TACCATTACC
GAAAATACTC TCGCCAATGC CCGCGAGCTG CTGCAAGGAA CGTAA
 
Protein sequence
MLCQLSINNF AIVRFLELDF RPGMTSITGE TGAGKSIAID ALGLCLGNRA DAGAVRPGAS 
KTEVSARFSL DDVPLARRWL EDNDLELDDE CILRRTITSD GRSRAYINGN PVPLSQLKSL
GQYLVGIHGQ HAHHALLKNE HQLTLLDSYA NHRMLQESVA SAYQRCRQIE AELRQLEQSQ
QERIARKQLL QYQVEELDEF ALMEGEFEQI EAEHKRLANG TALIEDCQHT LSLLSESDEG
NIESLLNRAL SLAATLESLD PTLAPIGNML NDALIQVQES SSELGRYLSN LELDPEHFAY
LEERLSKAMT LARKHHVAAD KLAAHHQELA RELAGLDSDE AQLDELKQQV QTSREAYLSH
AQKLSQSRLR YAKELDKQVT ASIQELNMPK GKFTIEVNFN DKVMSANGSD SVEFLVTTNP
GQPLSPLAKV ASGGELSRIG LGIQVITAKK VSTPTLIFDE VDVGISGPTA AVVGRMLRAL
GEATQVFCVT HLPQVAGNGH QHMFVNKSTK GGQTETSMKP LDKDARIQEL ARLLGGDTIT
ENTLANAREL LQGT