Gene YpAngola_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0037 
Symbol 
ID5798354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp26139 
End bp28055 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content53% 
IMG OID641337937 
ProductRecF/RecN/SMC domain-containing protein 
Protein accessionYP_001604554 
Protein GI162417866 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR03185] DNA sulfur modification protein DndD 


Plasmid Coverage information

Num covering plasmid clones120 
Plasmid unclonability p-value0.0000000000359393 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones440 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTC TAAAGCTCCA GGTTGAGAAC TTCATGGCGT TAGCCAGCGC CGAAGTTGAG 
TTAGACCAAC GCGGTCTGGT GCTCATTCAG GGTGTTAACA GTGGCGACTC TTCCGCTGCC
AGCAATGGCG CGGGCAAATC GACTTTGATG AACAGCCTGA TGTGGTGTCT GTATGGCGAA
ACTGCGCATG GCGTCAAAGG TGACGACGTG CTGTCTACAG GTCACGAAAA AAACTGTCGT
GTGATGGTAA CTGTTGAGGA TGAAGGAAAG CGTTACGCCA TCATTCGCCA CCGCAAACAC
AAAGAGTTCA AGAACCGGCT GATCGTCCGT GGCGAAGACG GTGACATGAC CAAAGGCAAA
GACACACTGA CGCAGGAGTT CGTTGAACGC CTGATTGGTG CATCGAAAGA GGTGTTCATG
GCGTCCATCT ACGCCAGTCA GGAAGCAATG CCAGATCTGC CGGGTATGTC CGACAAGAAC
CTCAAAACCA TCGTTGAAGA AGCCGCTGGC GTCGACCGGT TAACGCGAGC CTATGCCATT
GCTCGCGAGC GTGCTAATGC AGCTGCCGCA CGCATGGATG TTACCAAATC CAAAATGGAC
GCCTGTCTCA CGCTTATCGA GACCGCGCAG TCAGAGATTG AGGCGGCCAA AGCGTCCTCT
GATAGTTGGG AACGCGATCG CGGCGAACGT CTGGACAAGG CCCGCGTAGA TTTGGCTGGC
GCGGAGGTAA CGCTGTCTGA AGTCGTGATG GAAATTCGCT CGCTGCCGGA ACAGATCCGG
GATACGGAAA ACGCGATTGC TGGCGAACGC TGCAAGCTGG CCTCCAAAGA AGAGCATGAC
GCCAAACTGC TGAAGGTGCG CGGTGCGATT ACGGAGATCC GCTCAAGCAT CCACACTTCA
GAAGCGGCAC AGAACGAGTC GATGAACCGT GCTCGCTCGT TTAAAACCAA AGCAGAAGAG
GTCAGCACAA AGGTCGGAGC ACCTTGTGTT ACTTGCGGAA AGCCCTACTG CGAAGAAGAT
TTGTCCACCG TGAAGGAGAG TTTCATTGAA CAAGCGCGTA ATGAGATCGG CCAGGCGCAA
GCATCAGCTT CGGCAGTGGC TCAACACAAA GCTCGTCTTG AGAAAGCGCT CGGCATCGAA
TCTGCACTGG TCGCAGCCAC ACCCGACGTT TCAGAAATCA TCGCCAAAAT CGAACGCCTG
ACCAATGAGC TAAGTGCGCT GCGTCATCGC GAACGTGAAG TTGTGGCCGT CGAAGCGATG
GTGGCGCGGG CGCGTACCGA TGTGAATCGC ATTATGGCAG AGGTAAACCC ATTTCTGGCC
GTTATTAAGC GTCATGAGGA CAACCTGGCT GCCAATAAAT CTAATCATGC AGTACTTAAA
AATGAGTTAA AGAGTATTCA AGAACAGGCT CTGTTGCTGG AGAAGGCTCG CCAGGTTTAC
TCCCCTGCAG GTGTTCGTTC ACACATCCTG ACCTCCGTTA CGCCTTTCCT GAACATCAGG
ACTGCGGAGT ATCTCAACAC GCTATCGGAC GGCAATATCG TTGCCGAATG GTCGACAATG
GAGACAACGA AGAAAGGCGA GTATCGCGAC AAATTCAATA TAAGCGTGAC CAAAACAGGT
TCCAGCAAAT CCTTCCAGAC GTTGTCTGGT GGTGAGAAGC GTAAGGTACG TATTGCGTGC
TCTCTAGCCT TGCAGGATCT GGTTGCCAGT CGCGCCAGTA AGAATATCGA GCTGTTTATC
GGCGATGAAA TTGACGACGC GCTCGACACT GCCGGTCTGG AGCGTCTCAT GGGGATTCTG
GAAGCCAAAG CGCGTGAACG CGGCACAGTG ATGATCATCT CCCACAAAGA GATGAAATCG
TGGTTCCGGG AAACCATCAC TGTCGAAGTC AAAGAGGGTC GCAGCTATGT CGTTTAA
 
Protein sequence
MKFLKLQVEN FMALASAEVE LDQRGLVLIQ GVNSGDSSAA SNGAGKSTLM NSLMWCLYGE 
TAHGVKGDDV LSTGHEKNCR VMVTVEDEGK RYAIIRHRKH KEFKNRLIVR GEDGDMTKGK
DTLTQEFVER LIGASKEVFM ASIYASQEAM PDLPGMSDKN LKTIVEEAAG VDRLTRAYAI
ARERANAAAA RMDVTKSKMD ACLTLIETAQ SEIEAAKASS DSWERDRGER LDKARVDLAG
AEVTLSEVVM EIRSLPEQIR DTENAIAGER CKLASKEEHD AKLLKVRGAI TEIRSSIHTS
EAAQNESMNR ARSFKTKAEE VSTKVGAPCV TCGKPYCEED LSTVKESFIE QARNEIGQAQ
ASASAVAQHK ARLEKALGIE SALVAATPDV SEIIAKIERL TNELSALRHR EREVVAVEAM
VARARTDVNR IMAEVNPFLA VIKRHEDNLA ANKSNHAVLK NELKSIQEQA LLLEKARQVY
SPAGVRSHIL TSVTPFLNIR TAEYLNTLSD GNIVAEWSTM ETTKKGEYRD KFNISVTKTG
SSKSFQTLSG GEKRKVRIAC SLALQDLVAS RASKNIELFI GDEIDDALDT AGLERLMGIL
EAKARERGTV MIISHKEMKS WFRETITVEV KEGRSYVV