Gene YpAngola_A2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2053 
SymboluvrC 
ID5800523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2139470 
End bp2141323 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content45% 
IMG OID641339973 
Productexcinuclease ABC subunit C 
Protein accessionYP_001606522 
Protein GI162420815 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000011478 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000473429 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGAGAC GTTATCAAAT AGTGACTGAT CTTTTTGATT ATAAAGAATT TTTGAAAACT 
GTTACCAGTC AGCCTGGTGT TTATCGGATG TACGATACTG CAGGAACAGT TATTTATGTC
GGTAAAGCAA AAGATCTCAA AAAGCGTCTG ACGAGCTATT TTCGTGCTCA GGTCGCCAAC
CGAAAAACTG AAACGTTAGT TAAAAATATT GCTCAAATTG ATGTCACCGT TACGCATACT
GAAACGGAAG CATTGTTGCT TGAGCACAAT TACATCAAGC TTTATCAGCC ACGTTACAAT
GTGTTGCTAC GTGATGATAA GTCATATCCG CTTATTTTCT TGAGTGCGGA TGAGCACCCG
CGCCTTGCTG TGCACCGTGG CGCAAAACAT GAGAAAGGGG AGTATTTTGG GCCGTTTCCA
AACTCCTATG CAGTGCGTGA AACGTTGGCA TTATTACAGA AGCTTTTTCC AGTAAGGCAA
TGTGAGAATA GTGTTTACCG CAATCGCTCA CGGCCTTGCC TACAATATCA GATTGGGCGT
TGTTCCGGGC CTTGTGTGGA GGGGTTGGTA AGTGAAGAGG AGTATCAGCG CCAAGTTGAT
TACGTTCGCT TGTTCCTCTC AGGTAAAGAT CAGCAAGTCT TGACGCAGCT AATCACCCGT
ATGGAGGAGG CTAGCCAGCA ACTGCATTTT GAAGATGCAG CACGGATTCG CGATCAAATC
CAAGCTGTGC GTCGAGTAAC GGAACAACAG TTTGTTTCTG GCGACAGTGA AGATCTTGAT
GTTATCGGTG TGGCATTTGA TGCGGGGCTC GCTTGTGTCC ATGTACTGTT TATCAGATTA
GGGAAAGTAT TAGGTAGCCG GAGTTATTTC CCTAAAGTTC CCGCGGGTAC TGAGTTGAGC
GAAGTCGTTC AGACTTTTGT TGGGCAGTTT TATTTGCAAG GAAGCCAAGG GCGTACATTG
CCTGGTGAAA TTTTACTGGA TTTTACCCTC ACAGAAAAAG ACCTGTTGGC ATCTTCTCTT
TCTGAACTGG CTGGGCGAAA AATTCAGATA CAAAGCCGCC CCCGAGGAGA TCGTGCGCGT
TATCTCAAGT TAGCACGTAC CAATGCTTCA ACCGCATTGA TAACCCGCTT ATCTCAGCAA
TCGACTATTC ATCAACGCAT GAAGGAATTA GCCAAGGTCC TTAAGCTTGA TGAAATTAAT
CGTATGGAAT GTTTTGATAT CAGCCATACG ATGGGGGAAC AGACCGTAGC CTCTTGTGTC
GTATTTGATG CGAATTGTCC TGTACGATCT GAATATCGGC GCTACAATAT TAGTGGTATC
ACGCCGGGTG ATGATTATGC AGCGATGGCT CAAGTACTAA AACGTCGATA TGGGAAAGCA
TTAGACGATC AAAAAATTCC TGATGTCATA TTTATTGACG GTGGGAAGGG GCAGTTGTCA
CAGGCCTTTG ATGTTTTTGC CTCGTTGAAT GTTCCGTGGG ATAAGCAAAA GCCATTGCTA
GTCGGTGTAG CGAAAGGTAG TGATCGCAAA GCGGGTTTAG AAACATTATT CTTGGCATCA
GAAGGTGAGG GCTTTTCTTT ACCCCCAGAT TCACCAGCAT TACATCTGAT CCAGCATATT
CGTGATGATT CTCATAATCA TGCGATAACC GGGCACCGGC AGCGGCGATC TAAAGTAAAA
AATACCAGTG CGTTAGAAAT GATAGAAGGT GTTGGCCCCA AACGACGGCA AGTTTTGTTG
AAATATATGG GTGGACTACA ACCTTTGTTT AACGCAAGCG TCGAGGAAAT TGCAAAAGTG
CCGGGTATTT CACAAGCATT GGCAGAAAAA ATCCACAATG CATTGAAACA CTGA
 
Protein sequence
MRRRYQIVTD LFDYKEFLKT VTSQPGVYRM YDTAGTVIYV GKAKDLKKRL TSYFRAQVAN 
RKTETLVKNI AQIDVTVTHT ETEALLLEHN YIKLYQPRYN VLLRDDKSYP LIFLSADEHP
RLAVHRGAKH EKGEYFGPFP NSYAVRETLA LLQKLFPVRQ CENSVYRNRS RPCLQYQIGR
CSGPCVEGLV SEEEYQRQVD YVRLFLSGKD QQVLTQLITR MEEASQQLHF EDAARIRDQI
QAVRRVTEQQ FVSGDSEDLD VIGVAFDAGL ACVHVLFIRL GKVLGSRSYF PKVPAGTELS
EVVQTFVGQF YLQGSQGRTL PGEILLDFTL TEKDLLASSL SELAGRKIQI QSRPRGDRAR
YLKLARTNAS TALITRLSQQ STIHQRMKEL AKVLKLDEIN RMECFDISHT MGEQTVASCV
VFDANCPVRS EYRRYNISGI TPGDDYAAMA QVLKRRYGKA LDDQKIPDVI FIDGGKGQLS
QAFDVFASLN VPWDKQKPLL VGVAKGSDRK AGLETLFLAS EGEGFSLPPD SPALHLIQHI
RDDSHNHAIT GHRQRRSKVK NTSALEMIEG VGPKRRQVLL KYMGGLQPLF NASVEEIAKV
PGISQALAEK IHNALKH