Gene Smed_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4624 
Symbol 
ID5318935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1126767 
End bp1128005 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content61% 
IMG OID640776423 
Producthypothetical protein 
Protein accessionYP_001313355 
Protein GI150376759 
COG category[S] Function unknown 
COG ID[COG3748] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.449143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAGT ATGCCATCGC CTGGGATTGG CTGACATTTG CGGTGAGATG GCTGCATGTC 
ATCACCGGCA TCGCCTGGAT AGGCTCATCC TTTTACTTCG TTGCGCTCGA CCTGGGCCTG
AAGCAGCGTC CGGGCCTGCC GGTCGGCGCC TATGGCGAAG AGTGGCAGGT GCATGGCGGC
GGTTTCTACC ACATCCAGAA ATATCTGGTG GCGCCGGCAA ACATGCCGGA GCACCTGATC
TGGTTCAAAT GGGAATCCTA CGTCACCTGG CTCTCCGGTT TCGGCATGCT TGCGCTCGTC
TATTATGCCG GCGCGGACCT CTACCTCATC GATCCGAACG TGCTCGACGT TTCGAAGCCG
ATGGCGATCG CCATCTCGCT CGCCTCGCTC GGCTTCGGCT GGCTCGCCTA CGACATGATC
TGCCGATCCC CATTCGGCAA TGACAATACG CGGCTGATGG TGCTGCTCTA TTTCATTCTC
GTCGCCGTCG CCTGGGGTTA CACCCAGCTG TTCACGGGGC GTGCCGCCTA TCTGCATCTA
GGCGCCTTCA CGGCGACCAT CATGTCGGCG AACGTGTTCT TCATCATCAT CCCGAACCAG
AAGAAGGTCG TTGCCGACCT GATCGCAGGC CGCACGCCCG ATCCTGCTCT CGGAAAGCAG
GCGAAGCAGC GTTCGACGCA CAACAACTAT CTGACGCTGC CCGTGCTGTT CCTGATGCTG
TCGAACCATT ACCCGCTCGC CTTCGGCACG CAGTATAACT GGATCATCGC CTCGCTGGTT
TTCCTCATGG GGGTCACGAT TCGCCACTGG TTTAACACCC GCCACGCCAA CAAAGGCAGC
CCGACCTGGA CCTGGCTCGC GACCGTGCTC CTGTTCATCG CCATCATGTG GCTTTCCACC
GTGCCCAAGG TCCTCTCCGA GGGCGGAGAG GCAAGGGCAG CGACGGCGGC CGAGGCGGTG
GTCGCGTCTC CGGATTTCTC CAAAGTGCGC GACACCGTGC TTGGTCGCTG TTCGATGTGC
CATGCGCGGG AGCCCGGCTG GGAGGGTATC ATCGTGCCGC CAAAGGGCGT GATCCTCGAA
TCCGACGGCG ACATTGTTGC GCATGCCCGT GAAATCTACC TGCAGGCGGG GCGTTCGCAT
GCCATGCCGC CTGCCAATGT CACCGGCATC ACCGAGGAGG AACGCCAGCT CATCGCCTCC
TGGTACGAGA GGACGATAAA GGAAGGAAAA GTCCAATGA
 
Protein sequence
MYEYAIAWDW LTFAVRWLHV ITGIAWIGSS FYFVALDLGL KQRPGLPVGA YGEEWQVHGG 
GFYHIQKYLV APANMPEHLI WFKWESYVTW LSGFGMLALV YYAGADLYLI DPNVLDVSKP
MAIAISLASL GFGWLAYDMI CRSPFGNDNT RLMVLLYFIL VAVAWGYTQL FTGRAAYLHL
GAFTATIMSA NVFFIIIPNQ KKVVADLIAG RTPDPALGKQ AKQRSTHNNY LTLPVLFLML
SNHYPLAFGT QYNWIIASLV FLMGVTIRHW FNTRHANKGS PTWTWLATVL LFIAIMWLST
VPKVLSEGGE ARAATAAEAV VASPDFSKVR DTVLGRCSMC HAREPGWEGI IVPPKGVILE
SDGDIVAHAR EIYLQAGRSH AMPPANVTGI TEEERQLIAS WYERTIKEGK VQ