Gene Smed_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3953 
Symbol 
ID5318061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp403199 
End bp404368 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content62% 
IMG OID640775763 
Producthypothetical protein 
Protein accessionYP_001312696 
Protein GI150376100 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTG TCGATGATCT TGAGCGCAAA CCGGCGGAGG CGGTGCGGCT TGCGACATCA 
CGCGCCGAGG ATGCGTTCAC TGCCCTTGCC GACATCAGGC GGCAGCTCGC CACGGCAAGG
CCAAGTTTCA TTTTTCTGTT CGTGCCGAAC AGGCTGCAAA CGGACGACCT CGCCGCGGCG
CTTAGAAAGT CGTTGCCGAA CACGGTCGTC TTCGGCTGCA CGACGGCGGG CCAGATTACG
CCTCGTGGCT ATGAAAACGA CGCCCTGCTG GCGGTTGCCT TCCAACGGCA TCATTTTCGG
GTAGCCTCGA TCCTGTTCCA GCCGATTTCT CCGGTTTCGA TTGCCGATGT CGTGTCGCAG
ACGGAGCGGC TCGCCTCGCA GTTCCGCGTT ACGCCGGGCC GAAAGCGGTT GGCGCTGATC
TTTGCCGACG GACTGTCCAA GCAGGAGGAT GTTCTGGTGG CAGCGCTCGA GGCGGGGCTG
AAGGATATCC CGGTTTTCGG CGGTTCGGCC GGCGACGGGC TGCGGTTCCA GCGCACACAG
GTACTCCGCA ACGGTGAATT TCACAGCAAT GCGGCGCTTC TCCTGCTGCT CGAAACCGAC
CTGGAGTTCA GCGGCCTCGG CTTCGACCAC TTCCAGCCGA CCGACAAGCG CATGGTGGTG
ACCCGCGCCG TTCCGGAGGA ACGGCTGGTT CTGGAAATCA ACGGCTCTCC GGCGGCGGAG
GAATATGCCC GGCTCGTCAA GGTACCGGTC GACGAGTTGT CACCGATGGT CTTTGCGGAG
AACCCGGTGC TGGTGCGCAA CGGCAATCTC TATCACGTAC GGGCCATACA GCAGATACAC
GGCGAGCATG GCCTCACCTT CCTATCGGCG ATAGACGATG GTCTCCTGCT GCGGCTGGGT
CGCGGCAAGG AGATCATCCG CACGCTCGAG ACGGGGCTGG CGGTCAACGG CCATGACGGC
GAAGCGCCCG ACTTCATACT GGGATTCGAT TGCTATCTGC GCAAGCTCGA GATCGAGCGG
AAGGGCCTGG ACAGCGAGGT TTCCCAGCAA TTGCGGCAGA ACCGCGTCGT CGGCTTCAAC
ACCTATGGCG AGCAGCATCT CGGCGTCCAC GTGAATCAGA CCTTCGTCGG CGTCGCGTTT
TTCCGGCCAA AGGAGGGCGC GCCGCTATGA
 
Protein sequence
MTAVDDLERK PAEAVRLATS RAEDAFTALA DIRRQLATAR PSFIFLFVPN RLQTDDLAAA 
LRKSLPNTVV FGCTTAGQIT PRGYENDALL AVAFQRHHFR VASILFQPIS PVSIADVVSQ
TERLASQFRV TPGRKRLALI FADGLSKQED VLVAALEAGL KDIPVFGGSA GDGLRFQRTQ
VLRNGEFHSN AALLLLLETD LEFSGLGFDH FQPTDKRMVV TRAVPEERLV LEINGSPAAE
EYARLVKVPV DELSPMVFAE NPVLVRNGNL YHVRAIQQIH GEHGLTFLSA IDDGLLLRLG
RGKEIIRTLE TGLAVNGHDG EAPDFILGFD CYLRKLEIER KGLDSEVSQQ LRQNRVVGFN
TYGEQHLGVH VNQTFVGVAF FRPKEGAPL