Gene Smed_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2968 
Symbol 
ID5323845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3114373 
End bp3115761 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID640791919 
Producthypothetical protein 
Protein accessionYP_001328632 
Protein GI150398165 
COG category[S] Function unknown 
COG ID[COG4223] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC GACGGGGTGT GCCTGCTGCA GGTCCGCTCC GGCGAGGCCG TCAGGCAAAA 
AGGCGGTCCA ATAGGCAAAC TTATGAGGTG ACCATGGCAT CGGAAAAGCC GCCGCGCCGC
TCGAAAGCTG GAAAAGAACC GCTGACGATC GATCTCGAGG CGGAGAAAAC CGCGTCCGCG
CAGACCGAAG CCACCGCTGA CGGGGAAGCC GCACCCAGCC AGCCGGTGGA CATGCCGCCG
AGCGAGGCGG CGGCAGCCGG TGAAGCAGGC ACCGGCGGCG CCCGAAGCCG CGAAGGCGGC
ACCGGCGAAC CTCCATTGCC GGCAGGCGAT CCGCAAGAGG AAACGGAAGA GGCACGGGCA
GATGCCGCCG CTGCCGCTTT CGACGAGAAG CTTCCGCATT CGAGCAGCGG CGAAATGCCT
GAGCCGAAGA GGAGGCAGAC TGCGGGTGCA GGCGCGCTTG CAGCCGGCAT TCTCGGCGGC
CTCGTCGCGC TCGCCGGTGC CGGCGTGTTG CAATATGCCG GCTACATTCC TGCGCCCGGA
CCTGAGCGGC CTGGCACCGA GCAGAACCTT GCGGGCGAAA TCGAAGCGAT AAAGGCCGAG
CTCCGGGCTC AGGCTCCGGC GGCACCAGTC GACGTCGCCC CGCTGGAAAA CCGCCTTGCC
GCCCTGGAAA ATGCGGCGCG GGAGCCCGGC GCTGATGCGG AGGGCTCGCC TGAGGTCAAA
TCGCTCGAAG CCGAGGTCGC CAATCTGACG ACCGAAATCG CCACGCTCAA AACGGAGCTT
GCCGAAACCC GCCAGGCGGC CGACGCCGCT CGAGCAGAGC TTGCCGGCCG TATCGACCAG
GCGGAACAAA AGCTCAACGA GCCGGCAAAC GACATCGATA TGGCGAAAGC CGTCGCGGTG
ACGGCACTGA AGACGGCGAT CGATCGCGGC GGGCCGTTCC TCGCAGAACT CGACGCCCTG
CGCAGCATCG CGCCCGACGA TCCGGCGGTC AAGGAACTGG CGGCTGTAGC CGCAACGGGG
GTCGCCACGC GCGCGGCTCT CCGGGATAGC TTCCAGCCGG CGGCCGACGC GATGCTGAAT
GCGCTGCAGC AGCCCGATCC CAACCAAGGG ATCTTCGATC GCCTCGTCTC CAGCGCCATG
TCGGGGATCC GTGTTCGGCC GGTCGGCAGC GTCGAGGGCG ACACGCCGGA AGCGGTGATC
GCACGCATCG AGGACAAGCT CGACAATGGC GATCTCAAGG GCGCATCGCT CGAATGGAAC
AGCCTTCCCG AGGCGGCGAG ATCAGCCGGC CAGAAATTCA AGGAGAAGCT TGACCGGCGG
CTCAACGTGG AAACGGTGAT CGATGCCGCC GTCGCCGGGA CCATGGTTCG AACCGGTACA
CAAGGTTAG
 
Protein sequence
MLKRRGVPAA GPLRRGRQAK RRSNRQTYEV TMASEKPPRR SKAGKEPLTI DLEAEKTASA 
QTEATADGEA APSQPVDMPP SEAAAAGEAG TGGARSREGG TGEPPLPAGD PQEETEEARA
DAAAAAFDEK LPHSSSGEMP EPKRRQTAGA GALAAGILGG LVALAGAGVL QYAGYIPAPG
PERPGTEQNL AGEIEAIKAE LRAQAPAAPV DVAPLENRLA ALENAAREPG ADAEGSPEVK
SLEAEVANLT TEIATLKTEL AETRQAADAA RAELAGRIDQ AEQKLNEPAN DIDMAKAVAV
TALKTAIDRG GPFLAELDAL RSIAPDDPAV KELAAVAATG VATRAALRDS FQPAADAMLN
ALQQPDPNQG IFDRLVSSAM SGIRVRPVGS VEGDTPEAVI ARIEDKLDNG DLKGASLEWN
SLPEAARSAG QKFKEKLDRR LNVETVIDAA VAGTMVRTGT QG