Gene YpAngola_A4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4004 
Symbol 
ID5802484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4258526 
End bp4259569 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content53% 
IMG OID641341791 
Producthypothetical protein 
Protein accessionYP_001608298 
Protein GI162418794 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA ACACCGGTTC TGTAGTCGAT AAAAATACTG TCCCCCTTTC TGTGCTCGAT 
TTATCCCCGA TAGCACAGGG TAAAACAGCA CGGGATGCAT TTCATGCTTC ACTGGATTTA
GCACAACATG CTGAAAAATG GGGATATCAG CGCTACTGGC TCGCTGAACA TCACAATATG
ACCGGTATCG CCAGCGCGGC GACGTCGGTG CTGATCGGCT ATATTGCGAG TGGCACGAAT
AAGATCCGCG TCGGCGCTGG TGGTGTCATG CTGCCGAACC ATTCACCGCT GGTGATTGCT
GAGCAGTTCG GTACCCTCGC GTCGCTCTAC CCTGATCGTA TCGATCTCGG TTTGGGCCGT
GCACCCGGTA GCGATCAACG CACCATGATG GCATTACGTC GCCATCTCTC TGGTGAAGTG
GATAATTTCC CAGCCGATGT GCGTGAACTG CAGAATTACT TTGCCGAAGT ACAACCGGGG
CAAGCAGTAC AAGCAGTGCC AGGCCAGGGG CTGCATGTTC CGCTGTGGCT ACTAGGCTCC
AGTCTGTATA GCGCACAACT GGCCGCCGCG ATGGGCCTGC CGTTTGCCTT TGCCTCACAT
TTCGCCCCAG ATATGTTGTT ACAGGCACTC TCGCTATACC GTGAGAACTT TACACCATCC
GCTCAATGGC CAAAGCCCTA TGCCATTGTC TGCGTAAACG TGGTCGCGGC AGATAGCGAG
CGCGATGCAC GCTTCCTGTT TACCTCAATG CAGCAACAAT TTGTCAGCCT GCGCCGGGGG
ACACCGGGCC AGTTACCACC GCCAGTGGAG AATATGGCGG CAATTTGCTC GCCAGCGGAA
CAGTTTGGTG TCGATCAGGC GCTGCGCTTA TCCATCGTCG GTGATAAAAG TAAAGTACGC
CATGGGCTTC AGTCATTATT ACGGGAAACA CAGGCTGACG AATTGATGAT TAATGGCCAA
ATTTTTGATC ATCAAGCACG GCTGTATTCC TTTGAAACCG TTGCCAGCTT ACAGCAAGAT
CTGATGCATA CCCCGCGCCG GTAA
 
Protein sequence
MIDNTGSVVD KNTVPLSVLD LSPIAQGKTA RDAFHASLDL AQHAEKWGYQ RYWLAEHHNM 
TGIASAATSV LIGYIASGTN KIRVGAGGVM LPNHSPLVIA EQFGTLASLY PDRIDLGLGR
APGSDQRTMM ALRRHLSGEV DNFPADVREL QNYFAEVQPG QAVQAVPGQG LHVPLWLLGS
SLYSAQLAAA MGLPFAFASH FAPDMLLQAL SLYRENFTPS AQWPKPYAIV CVNVVAADSE
RDARFLFTSM QQQFVSLRRG TPGQLPPPVE NMAAICSPAE QFGVDQALRL SIVGDKSKVR
HGLQSLLRET QADELMINGQ IFDHQARLYS FETVASLQQD LMHTPRR