Gene SeD_A4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4441 
SymbolrhaA 
ID6873309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4283004 
End bp4284263 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content58% 
IMG OID642787359 
ProductL-rhamnose isomerase 
Protein accessionYP_002217970 
Protein GI198245581 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA AGCCTGGGAA CTGGCAAAAC AACGTTTCGC CGCAGTAGGT 
ATTGATGTCG AGGAGGCGCT GCGCCAGCTC GATCGCCTGC CGGTTTCCAT GCACTGCTGG
CAGGGCGACG ATGTTGCCGG ATTCGAGAAC CCGGAAGGTT CGTTGACGGG CGGAATTCAG
TCGACCGGCA ACTATCCGGG CAAAGCGCGT AACGCCACCG AACTGCGCGC CGATCTGGAA
CAGGCGCTGC GTCTGATCCC AGGACCAAAA CGGCTGAACC TGCACGCCAT TTACCTTGAG
TCGGATACGC CGGTTGCTCG CGACCAGATC AAACCGGAGC ATTTTAAAAA CTGGGTGGAG
TGGGCGAAAG CGAACCGGCT GGGGCTGGAT TTCAACCCCA CCTGTTTTTC TCATCCACTG
AGCGCTGACG GTTTTACCCT CTCTCATCCA GACGCGAAAA TTCGCCAGTT CTGGATCGAT
CACTGCAAAG CCAGCCGCCG CGTCTCGGCA TACTTTGGTG AGCAGCTCGG TACGCCGTCG
GTGATGAACA TCTGGATTCC GGACGGCATG AAAGACATTA CCGTCGACCA TTTAGCCCCG
CGCCAGCGCC TGCTGGAAGC GCTGGATGAG GTCATTAGCG AGAAATTCGA CCCGGCGCAC
CACATCGACG CCGTTGAGAG CAAACTGTTT GGCATCGGCG CGGAAAGCTA CACCGTCGGC
TCCAATGAGT TCTACATGGG CTACGCCACC AGCCGTCAGA CCGCGCTGTG CCTGGATGCG
GGCCACTTCC ATCCCACCGA AGTGATTTCC GACAAAATTT CCGCCGCCAT GCTCTACGTA
CCACGGCTGC TGCTGCACGT CAGCCGCCCG GTACGCTGGG ATAGCGACCA CGTGGTACTC
CTGGACGATG AAACCCAGGC AATTGCCAGC GAAATCGTCC GTCATAACCT GTTCGACCGT
GTGCATATCG GCCTGGATTT CTTCGACGTC TCTATCAACC GTGTCGCCGC GTGGGTTATC
GGTACCCGCA ACATGAAAAA AGCGCTGCTG CGCGCTCTGC TGGAACCCAC AGATCAACTG
CGTCAGCTTG AGGCCAGCGG TGATTACACC GCGCGTCTGG CGCTGCTGGA AGAGCAAAAA
TCTCTGCCGT GGCAGGCCGT CTGGGAAATG TATTGCCAGC GTCACGACAC GCCAGTTGGC
AGCCAGTGGC TGGACAGTGT TCGCGCCTAC GAAAAAGAGA TCCTGAGCAA ACGTAGATGA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVAGFEN PEGSLTGGIQ 
STGNYPGKAR NATELRADLE QALRLIPGPK RLNLHAIYLE SDTPVARDQI KPEHFKNWVE
WAKANRLGLD FNPTCFSHPL SADGFTLSHP DAKIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDHLAP RQRLLEALDE VISEKFDPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PRLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHNLFDR VHIGLDFFDV SINRVAAWVI GTRNMKKALL RALLEPTDQL
RQLEASGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPVG SQWLDSVRAY EKEILSKRR