Gene Smed_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2200 
Symbol 
ID5323060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2281006 
End bp2282685 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID640791138 
Producthypothetical protein 
Protein accessionYP_001327868 
Protein GI150397401 
COG category[S] Function unknown 
COG ID[COG4425] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0720421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATG TTCAGCAGAC GGACCTGCCA AGACCGGTGC CCCGTCGCAT CTCCAACTTC 
TTGCGTTCGT TTTCCACAAG CGGCCTCCTC ATAGGCGTTC TCTTCTTTGC CGTTTCGCTG
ACGCCAAGTC TCATACCGCG GCCCTACCTT ATTCAGGCGG TGATCTCGGG CTTCTCTCTC
GCGGCGGGCT ACGCGATCGG CGTGTTCCTG CGCTGGCTCT GGTCCTTCTT CGAGCTTCCC
GAACCGACGG TAAGGCGTGC ACGTACGCTG AAGATCGCCG CCGCCATCGT TTCGATTGCG
GCCGCCGTCG TATTCCTATG GCAGGCTTCG CATTGGCAAA ACACCGTGCG GCACATTATG
GGCCTGGAGC CGATCGAAAG CGCAGAGCCC GCCACTCTCG GCCTCATGGC CATTCTCGTA
TTCGCCGCCC TGGTCCTGCT GGCACGGCTA TTCCGGCTGA CCTTTCGCGT GCTTTCACGA
TGGCTGCAAT ATTTCCTCAC GCGGCCGGTT GCCAACGCGC TCGGCGGGCT CGTGGCGCTC
GCCTTGTTCT GGTCTGCGGC GAACGGTGTG ATCTTCAAGT TCGCGCTTCG CGCTGCGGAC
AGTTCCTTCC AGCAACTGGA TTCGCTCATC GATCCTGAGG TCGCACCGCC TGCGGATCCC
GGCAAGACAG GCAGCGCCGC ATCGCTTGTG CACTGGGACG AGCTTGGACG GCAAGGGCGG
CAGTTCATAG CCTCCGGGCC GACCGGCGCC GAAATCGGGG CGTTCTTCGG CATCGCGGCG
CCGGAACCCG TCCGGGTTTA TGTGGGACTG AACTCTGCCG AAACGGCGCG GGAAAGGGCG
AAGCTTGCGC TCGAGGAGTT GAAACGCGCC GGCGGCTTCG AACGCAAATC TCTGATCGTC
ATCGTGCCGA CCGGCACCGG CTGGATTGAT CCGGAGGCGC TCGACACCCT CGAATATCTG
CTTCACGGAG ATGTCGCGAG CGTGGCCGTA CAGTACTCCT ATCTCACCAG CTGGCTGTCG
CTTCTGGTCG AGCCGAGTTA CGGCGCCGAA GCGGCCGACG CCCTCTTCGA CGAGATCTAC
GGGCACTGGA CGACGCTGCC CAAGGATCGG CGGCCCAAGC TCTATCTCCA CGGTCTGAGC
CTCGGGGCGA TGAATTCGCA GGGGTCGGTC GATCTCTTCG ACGTCATCAG CGATCCCTTT
CAGGGCGCGC TCTGGAGCGG GCCGCCGTTC CAGAGCACCT TGTGGCGTTC GGTGACGGCG
GACCGGGTAC CGGACTCACC TGCCTGGCTG CCGCGCTACC GCGACAGCTC CGCCATCCGC
TTCACCAACC AGGAGAATGC CCTCGATATC CCCGGCGCGC ATTGGGGCGC GATGCGGATC
GTCTACCTGC AATATGCCAG CGACCCGGTG ACGTTCTTCG ATCCCCATTC CTTTTATCGC
GAGCCGGACT GGATGAGGTC GCCGCGAGGG CCGGACGTCT CACCGGCGCT GAGCTGGTTT
CCCTTGGTCA CCGGTCTGCA ACTGCTGGCC GACATGGCGT TGGCGACGAC CTCTCCGATG
GGCTACGGTC ACGTCTACGC CCCGGAACAC TACATTGACG CCTGGATGGC GGTCACCGAT
CCGCCGGGGA TTACGGCGGC GGATGTGGCG CGGCTGAAAG CGCAATTCTC CGCGCGTTGA
 
Protein sequence
MEHVQQTDLP RPVPRRISNF LRSFSTSGLL IGVLFFAVSL TPSLIPRPYL IQAVISGFSL 
AAGYAIGVFL RWLWSFFELP EPTVRRARTL KIAAAIVSIA AAVVFLWQAS HWQNTVRHIM
GLEPIESAEP ATLGLMAILV FAALVLLARL FRLTFRVLSR WLQYFLTRPV ANALGGLVAL
ALFWSAANGV IFKFALRAAD SSFQQLDSLI DPEVAPPADP GKTGSAASLV HWDELGRQGR
QFIASGPTGA EIGAFFGIAA PEPVRVYVGL NSAETARERA KLALEELKRA GGFERKSLIV
IVPTGTGWID PEALDTLEYL LHGDVASVAV QYSYLTSWLS LLVEPSYGAE AADALFDEIY
GHWTTLPKDR RPKLYLHGLS LGAMNSQGSV DLFDVISDPF QGALWSGPPF QSTLWRSVTA
DRVPDSPAWL PRYRDSSAIR FTNQENALDI PGAHWGAMRI VYLQYASDPV TFFDPHSFYR
EPDWMRSPRG PDVSPALSWF PLVTGLQLLA DMALATTSPM GYGHVYAPEH YIDAWMAVTD
PPGITAADVA RLKAQFSAR