Gene Smed_5364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5364 
Symbol 
ID5319666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp330875 
End bp332026 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content62% 
IMG OID640777136 
ProductHPP family protein? 
Protein accessionYP_001314068 
Protein GI150377473 
COG category[T] Signal transduction mechanisms 
COG ID[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.280172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTTC CGCCTCCACC CGATAATGTT CAGAAGCCAG GGTCTTCGCC CAGGTTTAAG 
CTGTTTTCCC CGATCCTGGC GGGCGCAACG CTGAAAGAGC GCCTTATCGG CTGTCTCGGA
GCGCTGATCG GCATCTGTTT GACCGGTCTC GTCTGCGGGT TCATCTTCGG CGACGATCCT
CAGCTGCCGT TGATCGTTGC CCCCATCGGC GCTTCTGCGG TCTTGCTCTT TGCGGTTCCG
GCCAGTCCGC TTGCGCAGCC CTGGTCGATC ATCGGCGGGA ACACGATCTC CGCCCTGATC
GGCGTCACCG TGAGCTATTT CGTGAAGGAC CAGATGGTCG CCATCGGCCT TGCGGTTGCC
CTGGCGATCC TTGCCATGTC GCTCACGCGG TCGCTTCACC CCCCTGGAGG CGCCGCCGCG
CTGACAGCGG TGATAGGTGG AGCGGCGATC GCGCGCGCGG GTTTCTGGTT CCCGTTCATA
CCGGTAGCCA TCAACTCCCT GATCCTTGTG GGATTGGGCA TCGTATTCCA CCGGATGGCG
CGGCGCCAAT ATCCGCACCG ACCGGCTGTT GCACCAGTGA ACACGCATGA AACGGCCGAT
CCCCCGCCTG CGCTTCGGGT CGGCTTCAAT TCCGAGGACA TTGATCTCGC AATAGCACGC
TTGAACGAGA CACTCGACGT CAGCCGCGCG GACATCGATG CTCTCTTGAG GGAAGTCGAA
CTGCAGGCCC TCATCAGACA GAGGGGAGAG CTGACATGTG CCGACATCAT GTCCCGCGAC
GTCGTAACCG TTCCGGCCGA CACGACGCCG GACCATGCAC GATACCTCTT GTTGAAGCAT
GATATTCGAA CACTTCCCGT GCTCGACGAA AACGGGAAAC TGCAAGGGAC GGTCGGCCTG
CGAGAGCTGG CCGGCAAGGA ACCCGGCAGC AAACTGCCGA TCGCCGTGGC GGCCACCGCC
AACCCGTCCG ACCCTGCGAT CAGCCTGCTC CCCCGCCTGA CGGACGGCAT GACACATGCC
GTCGTCATAC TGGACGACGA CGAAAAGGTC GTGGGCATCA TATCCCAAAC CGACCTGCTC
GCGACTTTGG CCAAAAGCAT CTCCCAGAAC GGTGCGTCCG AGATCATGCG AGGCCACGGA
CAGGGCATCT AG
 
Protein sequence
MIVPPPPDNV QKPGSSPRFK LFSPILAGAT LKERLIGCLG ALIGICLTGL VCGFIFGDDP 
QLPLIVAPIG ASAVLLFAVP ASPLAQPWSI IGGNTISALI GVTVSYFVKD QMVAIGLAVA
LAILAMSLTR SLHPPGGAAA LTAVIGGAAI ARAGFWFPFI PVAINSLILV GLGIVFHRMA
RRQYPHRPAV APVNTHETAD PPPALRVGFN SEDIDLAIAR LNETLDVSRA DIDALLREVE
LQALIRQRGE LTCADIMSRD VVTVPADTTP DHARYLLLKH DIRTLPVLDE NGKLQGTVGL
RELAGKEPGS KLPIAVAATA NPSDPAISLL PRLTDGMTHA VVILDDDEKV VGIISQTDLL
ATLAKSISQN GASEIMRGHG QGI