Gene YpAngola_A1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1753 
Symbol 
ID5800224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1815313 
End bp1816437 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content47% 
IMG OID641339687 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001606242 
Protein GI162420801 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000408747 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAATC AACATCCTGA CTTTGTTTTG CCAAAAGACT TTTGTGCAAA CCCGCGTGAG 
GCTTACACCA TTCCTGCTTA TTTCTATACC CAACAGGCGG CCTTCGAGCA TGAGAAAGAG
AGGGTATTTA CTAATAGCTG GATTTGTATG GCGCACGGCA GTGAAGTGGC GCAGCCTAAT
GATTACATCA CTCGTGAGAT TATTGGTGAG AACATCGTTA TCGTGCGTGG ACGTGATAGC
GTATTGCGGG CTTTTTATAA TGTCTGTCCG CATCGTGGGC ACCAATTACT CAGTGGGGAA
GGTAAAGCAA AGAATGTCAT TACTTGCCCC TATCATGCCT GGACCTTCAA GCTCGACGGT
GAACTGGCAC ATGCCCGTAA CTGCGAGAAT GTCACTAATT TTGATAAAGA TCGGGCGACG
CTTTTCCCGG TGCGCTTAGA GGAATATGCC GGTTTTATTT TCATCAATAT GAACCCAGAC
GCCGAAAGCG TGGAACAACA ATTGCCGGGC TTACAGGATA AAGTTTTCGA AGCGTGCCCC
GATGTGCACG AGTTGAAATT GGCCGCTCGC TTCACCACCC GTACCCCTGC TAACTGGAAG
AATATTGTCG ATAACTATAT GGAGTGTTAT CACTGCGAGC CTGCTCACCC AGGATTCGCC
GATTCAGTGC AGATTGATCG CTACTGGCAC ACCATGCACG GTAATTGGTC ATTACAATTT
GGCTATGCGA AACCCTCAGA AAAGTCCTTT AAATTTGAAG AGGGTGAAGA GTCCTCTTTC
CACGGCTTTT GGCTATGGCC ATGCTCAATG TTCAATGTGC CGCCACTGAA AGGCATGATG
ACCGTAATTT ATGAGTTCCC AGTCGATGCA GAAACGACGC TGCAAAACTA TGATATTTAT
TTTACGAACG AAGAACTGAC GGAAGACCAG AAGGCGCTTA TCGAATGGTA TCGCAATGTC
TTTCGCCCGG AAGATTTACG CTTGGTTGAA AGCGTTCAGA AAGGGCTGAA ATCACGCGGC
TATCGTGGTC AAGGGCGTAT CATGGCTGAT GATAAAGGCA GTGGGATCAG TGAGCATGGT
ATTGCTCACT TCCATAATCT GGTGGCGAAG GTCTTTCAAG AGTAG
 
Protein sequence
MSNQHPDFVL PKDFCANPRE AYTIPAYFYT QQAAFEHEKE RVFTNSWICM AHGSEVAQPN 
DYITREIIGE NIVIVRGRDS VLRAFYNVCP HRGHQLLSGE GKAKNVITCP YHAWTFKLDG
ELAHARNCEN VTNFDKDRAT LFPVRLEEYA GFIFINMNPD AESVEQQLPG LQDKVFEACP
DVHELKLAAR FTTRTPANWK NIVDNYMECY HCEPAHPGFA DSVQIDRYWH TMHGNWSLQF
GYAKPSEKSF KFEEGEESSF HGFWLWPCSM FNVPPLKGMM TVIYEFPVDA ETTLQNYDIY
FTNEELTEDQ KALIEWYRNV FRPEDLRLVE SVQKGLKSRG YRGQGRIMAD DKGSGISEHG
IAHFHNLVAK VFQE