Gene Smed_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3002 
Symbol 
ID5323879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3152611 
End bp3153867 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID640791953 
Producthypothetical protein 
Protein accessionYP_001328666 
Protein GI150398199 
COG category[S] Function unknown 
COG ID[COG2966] Uncharacterized conserved protein
[COG3610] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.682044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCAGT CCGATAATAA AAGGCCCGAA CCCGGCGTGC CGGACGACTC GATCGACATG 
CTGCTGCGGT TTGCCGCTAT GATGTTTCGT TCCGGGGGCA CCGCCACCCG TACCCGCGAA
CTTGTGGGGG CGATGGCGCA CCGCCTCAAC CTGGAAACGC CATCGGTGAG TCTCACGCTC
GAATCCGTCA CCCTCAGTGT TTATCGCGGC AGCGAACAGT TCGTAGCGAT ACGCGAGATC
GGTCCGCCCG GAATCGACTT CAGGCGCCTC GGTGAGTTGG AGCGGCTTGC GAACGCGCCA
GAGGCAACGG CACCGCCACA CCGGATCGCG ACAGAACTTG CTGAAATCGA ATCGCGCGCG
CCTCGATATT CCGGGTGGCA GATGGCGATC GCGATCGGCC TGGCCAGCGG AGGGTTTGCG
TTCCTCAACG GAGCGGCCCT GCCGGAAATG GCCACCGCCG CGATCGGTGG CGGCACAGGT
CAAGGATTGC GTTGGTGGTT GACCCGCCGC CAATTGACCG ACTTCGGCAC GGCGGCATTG
GCTGCTGTAA CCGCTGCCGG AACTTACGTT CTGGTGGCGG CGCTGGCGCA CCGTGCAGGG
ATTGCGTTCT CGCACTATGC CGCGGGCTTC ATCTCCACCA TACTATTTCT CATCCCGGGC
GTTTCGCTGA TTGCAGGATT GTTCGACCTG TTGCAGCATC AGACGGTGGC TGCCTTAAGC
CGGCTGGCAC ATGGCGCGTT GATCCTGTTC ATCGTCGCTT CGGGGCTGAG CATCGTGATG
ACTGTTGCGA GCATCGAGCT GTTGCCGCGC TCTGCGCCGG CCGAGCTTGC CTATCCGTTG
CGCCTTTCGC TTCGCGCCGT CGCGAGCTTC GTCGCTGGCT GCGGCTTCGC CATGCTGTTC
AACAGCGCGC CATTTTTGGT GGTCGTCGCG GGCATCGTGG CGCTGGCGGC GAATAGCGTG
CGCCTCGTCC TGATCGACAT GGGAATGCTG CTGGCGCCGG CGGCGTTCAT CGCCGCGTTT
TCGATAGGAA TCATCGCCGT TCTTGCGAGC CGGCGGTTGG ACGCAGAGCT CATGGCCATT
GTTACCCCGC CAGTCGTCAT CATGATTCCG GGTCTCTACG CATTCGAGAT GCTTGTTCTG
TTCAACCGGG GGCAGATGCT CGAGGCCATG CAGGCCTCGG GGGCAGGCAT CTTCGTGATC
AGCGCGCTGG CGATGGGGTT GAGCGTGGCG CGCCTTGCAG TCCCGTGGGA ACGATAG
 
Protein sequence
MTQSDNKRPE PGVPDDSIDM LLRFAAMMFR SGGTATRTRE LVGAMAHRLN LETPSVSLTL 
ESVTLSVYRG SEQFVAIREI GPPGIDFRRL GELERLANAP EATAPPHRIA TELAEIESRA
PRYSGWQMAI AIGLASGGFA FLNGAALPEM ATAAIGGGTG QGLRWWLTRR QLTDFGTAAL
AAVTAAGTYV LVAALAHRAG IAFSHYAAGF ISTILFLIPG VSLIAGLFDL LQHQTVAALS
RLAHGALILF IVASGLSIVM TVASIELLPR SAPAELAYPL RLSLRAVASF VAGCGFAMLF
NSAPFLVVVA GIVALAANSV RLVLIDMGML LAPAAFIAAF SIGIIAVLAS RRLDAELMAI
VTPPVVIMIP GLYAFEMLVL FNRGQMLEAM QASGAGIFVI SALAMGLSVA RLAVPWER