Gene SeD_A3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3970 
SymbolyhiR 
ID6875288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3807421 
End bp3808263 
Gene Length843 bp 
Protein Length280 aa 
Translation table11 
GC content57% 
IMG OID642786927 
ProductDNA utilization protein YhiR 
Protein accessionYP_002217555 
Protein GI198242731 
COG category[R] General function prediction only 
COG ID[COG2961] Protein involved in catabolism of external DNA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTT ATCGTCACAG CTTTCACGCT GGCAACCACG CCGACGTCCT TAAACATACC 
GTTCAGAGCC TGATCATCGA GTCGCTAAAA GAGAAAGAAA AACCGTTTCT CTATCTGGAC
ACGCACGCGG GCGCGGGGCG TTATCAATTG GGCAGCGAAC ATGCTGAACG TACCGGAGAG
TATCTGGAAG GCATCGCCCG TATCTGGCAG CAGGACGATC TGCCCGCCGA ACTGGAACCG
TATATTAGCG TCGTAAAACA TTTCAACCGC AGCGGGCAGT TACGCTACTA TCCGGGCTCC
CCGTTAATCG CCCGCCAGTT GCTGCGTGAG CAGGACAGTC TGCAACTCAC GGAATTGCAT
CCCAGCGACT TCCCACTGTT GCGCGCGGAG TTTCAAAAAG ACAACCGCGC CCGCGTGGAA
CGCGCTGACG GCTATCAGCA ACTGAAAGCC AAATTACCGC CGGTTTCCCG CCGCGGTCTG
ATCCTCATTG ACCCGCCTTA TGAAATGAAA ACCGACTACC AGGCGGTAGT CAGCGGCATC
AGCGAGGGTT ATAAACGTTT CGCCACCGGG ACATACGCGC TATGGTATCC GGTGGTGCTC
CGCCAGCAAA TTAAGCGCAT GGTTCATGAT CTGGAGGCCA CCGGCATCCG TAAAATCCTG
CAAATTGAGC TGGCGATCCG CCCGGACAGC GATCAGCGCG GGATGACGGC GTCCGGTATG
ATCGTGGTTA ACCCGCCGTG GAAACTGGAG CAGCAGATGA ACAACGTGCT ACCGTGGCTG
CACAGCAGGT TGGCGCCAAA CGGTCACGGA CACACTTCGG TAAGCTGGAT CGTGCCGGAG
TAA
 
Protein sequence
MLSYRHSFHA GNHADVLKHT VQSLIIESLK EKEKPFLYLD THAGAGRYQL GSEHAERTGE 
YLEGIARIWQ QDDLPAELEP YISVVKHFNR SGQLRYYPGS PLIARQLLRE QDSLQLTELH
PSDFPLLRAE FQKDNRARVE RADGYQQLKA KLPPVSRRGL ILIDPPYEMK TDYQAVVSGI
SEGYKRFATG TYALWYPVVL RQQIKRMVHD LEATGIRKIL QIELAIRPDS DQRGMTASGM
IVVNPPWKLE QQMNNVLPWL HSRLAPNGHG HTSVSWIVPE