Gene Smed_2934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2934 
Symbol 
ID5323811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3080586 
End bp3081677 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID640791885 
ProductGumN family protein 
Protein accessionYP_001328598 
Protein GI150398131 
COG category[S] Function unknown 
COG ID[COG3735] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCT TCACCAGACC CGTCCGCGAT CTCGCGGCCA AGGCGACCGA CTGCCTGTTG 
TGGCTCACGG CATCGTTCCA CATACTGATG GCCGCGAGCC TCCTTGCCGC CCTGCTCTAT
GTCTCGGAGG CTCGGGCGGA TGAGACGGAT TGCGTCGGCA GCAATATTTT GACCGGACTC
GAAAATTCTG ATCCTGCCCG TCTGGCGGCT TTGCGTGCGG AAGCGGCGGC GATACCCAAC
GGAAAGGGCC TGCTCTGGCG AATCGAAGAT CCCGCCGCTC AAGGCCAGGG ACGGGCGCCC
TCCTTTCTGC TCGGCACCAT GCATGTCAGC GACCCGCGCG TGCTGGCGAT GCCTGGCGGG
GCGGCGCAAG CCTTCGCAAA AGCGCGGACC GTCATCGTGG AATCCGACGA GATTATCGAT
CAGAACCGGG CGACCGCCGC GATCATGATG CGGCCCGATC TGACCATGTT CACCGGCGAC
AAGACGATCA ACGACTTCCT GAAGCCGGAG GACCTCGCCC TTCTCGAAGG CGGACTGAAG
GCTCGCGGCA TCCCCCTGCC CCTCGTCACC AGGATGAAGC CCTGGATGAT CGCCAGCTTC
GTGGCCCTGC CGGCCTGCGA ATTCTCACGC AAGGCGGCCG GCGCCTCCTT CCTCGATAAG
AAGCTCGCCG AGGACGCGGT GAGGGAGGGC AAGACGCTCA AGGGGCTCGA AACGCTGGTC
GAACAGCTTG CGGCAATGGA TTCTCTGCCG GTCGAACTGC ACTTGAAGGC ATTGATCGAA
ACGCTCGCTC TCGGCAAGAC GATCGACGAC GTGTTCACGA CGACCACCGA TCTCTATCTT
TCCGGCGAGA CGGGCACCAT CATGCCCATG ATGAAACTGG TCTCCGCCGG GCTTTCGCCT
AATGATGCCG GCTATGCCGA ATTCGAGCAA AGGATCGTCG TCGACCGCAA CAGGATCATG
GCGGACCGTG CCGGACCTAT CCTGAGGGAC GGCGGCGCCT TCATGGCCGT GGGCGCGCTG
CATCTTCCGG GCAAGGAAGG CCTGGTCGAA CTTCTGCGGC AAGAGGGTTT TAAGGTTACG
CGGGAGGAAT GA
 
Protein sequence
MITFTRPVRD LAAKATDCLL WLTASFHILM AASLLAALLY VSEARADETD CVGSNILTGL 
ENSDPARLAA LRAEAAAIPN GKGLLWRIED PAAQGQGRAP SFLLGTMHVS DPRVLAMPGG
AAQAFAKART VIVESDEIID QNRATAAIMM RPDLTMFTGD KTINDFLKPE DLALLEGGLK
ARGIPLPLVT RMKPWMIASF VALPACEFSR KAAGASFLDK KLAEDAVREG KTLKGLETLV
EQLAAMDSLP VELHLKALIE TLALGKTIDD VFTTTTDLYL SGETGTIMPM MKLVSAGLSP
NDAGYAEFEQ RIVVDRNRIM ADRAGPILRD GGAFMAVGAL HLPGKEGLVE LLRQEGFKVT
REE