Gene Smed_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2666 
Symbol 
ID5323535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2771514 
End bp2772659 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID640791610 
ProductSel1 domain-containing protein 
Protein accessionYP_001328331 
Protein GI150397864 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0163222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.379262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATCC GCTCCCTCCT GAATTCCAGC CGGCCCGTAG CCTTTGCCGC AGCGATGCTT 
TCAGCAGTCG CGCCGGCACT GGCACAGCAA CCGATACCGC CAGCGCAGAC CGACGAGGGC
AGTGCGCAGA AGCGCGGGCG GATCACGCCT TTCAACGGCG CGGCACTACC GGAAGATGGC
GAGCGAAAGC CGGCAGCGGC CGGGCCGGAA AAGCCGAAGC CCAGCGATGG CAGTGCGCCT
TCGAAGGGCG TCAACGTGAT CGATCGCATG GGAGCAGAAT TGCCTGATCT TCCTGAGGAG
AAACCCTTCA CAGGCAAGAT CGACGAAGCT TATGGTGCCT TCCAGCGGGG CTATTACCTC
ACGGCGATGG ACCTTGCCTT GCCGCGGGCC CAGCTCGGCG ATCCAGCCGC CCAGACGCTC
GTCGCGGGAA TCCTCGAGCA GGGACTCGGT GTTGCGCGCG ACGCCAAGGC CGCCGCCTTC
TGGTACGGCC AGGCGGCCAC CAATGGCGAT CCGGCGGCCA TGTTCAAATA TGCGCTGATC
CTGATGGAAG GCCGCCACGT CAAGCGCGAC CGGAAGAAAG CAGACGAGCT CATGAAAAAG
GCTGCCGATC TCGGCAATGC CTCGGCTCAG TTCAACTACG GACAAACGCT GGTGGCCGAC
ATGCCCGGCG AGCGCGGCCT GAAGGCCGCC ATGCCCTATT ACGAGAAATC GGCCGAACAA
GGCATTGCGG ACGCGCAATA TGCGCTGTCC CAGATCTACG TCAATGTCGA CGGCGTCGAG
GACGACAAAC GCGCCCGCGC CCGCGAGTGG CTGCTCAGGG CGGCGCGCGC GGGCTATGAT
ACGGCGCAGC TCGACATTGC GATCTGGCTG ATAGAAGGGA TCGCCGGCGA CCGCAACCTC
GAAGAGGGCT TTGCCTGGAT GAAGCGCGCC GCCGAAAGCG GCAACGTCGT CGCCCAGAAC
CGACTCTCCC ACCTCTATGT GAATGCCATA GGTACGCGTC CGGACCCCGT CGAAGCGGCA
AAATGGTACG TCCTGTCGCG CCGGGCCGGC CTCAAAGATG ACGCGCTCGA GGATTTCTAT
CTCGGCCTCA ACGAAACGCA GCAGAAGTCG GCGCTCGCGG CGGCCAACAA ATACCGCTCG
TCCTGA
 
Protein sequence
MVIRSLLNSS RPVAFAAAML SAVAPALAQQ PIPPAQTDEG SAQKRGRITP FNGAALPEDG 
ERKPAAAGPE KPKPSDGSAP SKGVNVIDRM GAELPDLPEE KPFTGKIDEA YGAFQRGYYL
TAMDLALPRA QLGDPAAQTL VAGILEQGLG VARDAKAAAF WYGQAATNGD PAAMFKYALI
LMEGRHVKRD RKKADELMKK AADLGNASAQ FNYGQTLVAD MPGERGLKAA MPYYEKSAEQ
GIADAQYALS QIYVNVDGVE DDKRARAREW LLRAARAGYD TAQLDIAIWL IEGIAGDRNL
EEGFAWMKRA AESGNVVAQN RLSHLYVNAI GTRPDPVEAA KWYVLSRRAG LKDDALEDFY
LGLNETQQKS ALAAANKYRS S