Gene YpAngola_A2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2478 
Symbol 
ID5800948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2594052 
End bp2595143 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content47% 
IMG OID641340352 
Producthypothetical protein 
Protein accessionYP_001606895 
Protein GI162420416 
COG category[R] General function prediction only 
COG ID[COG1073] Hydrolases of the alpha/beta superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000532827 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.854923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA ATGCTTTTAA AATGACATCC GTTTTTAAAA CGACATCGGT TTTTAAATCC 
ACCTTGTTAG CGGCAGTGAT CAGTCTTTCA TCTGGGGGAG CAATGTCAGC CGATTATAAA
TTAAATCCAT TTACACTGAC TTATGACGGC GCGATTACCG AAAACGTGAA AGGCAAAGTC
AATATTCATC CTGTTACTTA TAAGCTAAAT GGTTTGGAAA TTTCTGCCAA CGTATATACC
CCTGCGAATT ATGATCCCGC TAAAAAATAC CCGGCCGTGG TGGTGGCTCA CCCCAATGGT
GGCGTGAAAG AGCAGGTCGC TGGCTTGTAT GCTCAGCGTT TGGCGGAACA AGGCTATATC
ACCATTACTG CGGATGCCGC GTATCAAGGC GCGAGTGGCG GCTTGCCCCG CAATGTCGAC
AAACCCGCTT ACCGCATTGA AGATATTCAC GGCATGGCAG ACTTTATTAC CCAATATCCT
GGGGTGGACA CGGTCCGTTT AGGCCTGCTG GGTATTTGCG GCGGTGGGGG CTACTCCTTG
AAAGCCGGGC AAACCGACAA GCGCTTCAGC GCGATTGCCA CGTTGAGCAT GTTTGATTCC
GGTCTGGTAA GACGCAACGG TTATCAGGAC TCACAACTTT CCACAATACA AGAACGTTTA
AAACAAGCCT CTGATGCACG AGCACAGGAA GCTGCTGGTG GAGAGGTTAC TTACGTTGGG
GATGCCAAAC TGACTGATGA GCAAATAGCC AAACTGCCGT TTGACCTTTA TCGGCAGGGT
TTTGAGTACT ACGGTAAAAC GCATGCTCAC CCCAATTCAA CCTTCAGATA CACCGCCAGC
AGTCTGCTCG ACTTAATGAG ATTCGATGCG GCAAGCAATA TGGATTTAAT CGATAAGCCG
CTACTGATGA TGGCAGGCAG TAAAGCCGAT TCCCTGTATA TGAGCGAAAC AGCGTTTAAA
GGTGCTAGCA ACGCAAAAGA TAAGGAACTG TTCCTGATTG ATGGTGCGAC CCACATTGAA
ACCTATTGGC AGCCGGAATA TGTGAATCAG GCTATAGGTA AATTGGCTCA GTTCTACGGG
AAAAATTTAT AA
 
Protein sequence
MKHNAFKMTS VFKTTSVFKS TLLAAVISLS SGGAMSADYK LNPFTLTYDG AITENVKGKV 
NIHPVTYKLN GLEISANVYT PANYDPAKKY PAVVVAHPNG GVKEQVAGLY AQRLAEQGYI
TITADAAYQG ASGGLPRNVD KPAYRIEDIH GMADFITQYP GVDTVRLGLL GICGGGGYSL
KAGQTDKRFS AIATLSMFDS GLVRRNGYQD SQLSTIQERL KQASDARAQE AAGGEVTYVG
DAKLTDEQIA KLPFDLYRQG FEYYGKTHAH PNSTFRYTAS SLLDLMRFDA ASNMDLIDKP
LLMMAGSKAD SLYMSETAFK GASNAKDKEL FLIDGATHIE TYWQPEYVNQ AIGKLAQFYG
KNL