Gene YpAngola_A1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1728 
Symbolcas1 
ID5800199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1782294 
End bp1783274 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content48% 
IMG OID641339666 
Producthypothetical protein 
Protein accessionYP_001606221 
Protein GI162418150 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00792911 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000413169 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAACG CTATTCATTC CTCTGATTTG AAAACGATCC TGCATTCAAA ACGATCCAAT 
ATTTACTATT TAGAATATTG CCGTGTATTG GTTAATGGTG GGCGAGTTGA ATATGTCACC
GATGAAGGTA AACAATCCCT TTACTGGAAT ATCCCCATAG CGAACACCAC CGTTATTATG
CTGGGAACCG GGACTTCGGT GACTCAGGCT GCTATGCGTG AGTTTGCCCG AGCCGGGGTC
TTAGTCGGTT TTTGTGGCGG GGGTGGGACG CCGCTTTTTG CGGCTAATGA CGTAGAGGTC
AATGTCTCGT GGCTCACTGC ACAAAGCGAA TACCGGCCAA CCGAGTATCT GCACGATTGG
GTCAGTTTCT GGTTCGATGA TGAAAAAAGA CTGGCAGCAG CAGTGGCTTT CCAGCGCATC
AGGATCGCCC AAATTCAACA ACATTGGCTC AGCAGCCACA TACAGCGCGA ATCTCTTTTT
CCGGTTAATC ACGATCAATT ATTATTTATC CTCACCCGTT TTGAGCAAAA TTTAGCAAAT
TGTCTCACCA GTAATGACCT TATGGTTCAG GAAGCGGTAT TAACAAAGGC ACTCTATAAA
CTGGCTGCTA ATACAGTGAA TTACGGCGAT TTCACCCGCG CTAAACGCGG TGGGGGCATC
GATCTAGCTA ATCGTTTTCT CGATCACGGA AATTATCTCG CCTATGGCTT AGCGGCGACG
GCGACATGGG TTATTGGCTT ACCCCATGGT CTGTCTGTTT TACACGGTAA GACCCGGCGT
GGTGGTTTGG TCTTTGATGT GGCCGATTTA ATTAAAGATG CGCTGGTGCT ACCGCAGGCA
TTTATTGCCG CCATGCAGGG AGAAGAAGAG CAAGAATTTC GTCAGCGCTG CATTAGCGGG
TTTCAACGAA CCGAAGCGCT GGATGTGATG ATTGATGGAA TAAAAGAAAC GGCAGCGTTA
TGTAGCCAGG TTCCGCGATG A
 
Protein sequence
MENAIHSSDL KTILHSKRSN IYYLEYCRVL VNGGRVEYVT DEGKQSLYWN IPIANTTVIM 
LGTGTSVTQA AMREFARAGV LVGFCGGGGT PLFAANDVEV NVSWLTAQSE YRPTEYLHDW
VSFWFDDEKR LAAAVAFQRI RIAQIQQHWL SSHIQRESLF PVNHDQLLFI LTRFEQNLAN
CLTSNDLMVQ EAVLTKALYK LAANTVNYGD FTRAKRGGGI DLANRFLDHG NYLAYGLAAT
ATWVIGLPHG LSVLHGKTRR GGLVFDVADL IKDALVLPQA FIAAMQGEEE QEFRQRCISG
FQRTEALDVM IDGIKETAAL CSQVPR