Gene Smed_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0465 
Symbol 
ID5321299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp501166 
End bp503100 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content62% 
IMG OID640789400 
Productputative transmembrane signal peptide protein 
Protein accessionYP_001326157 
Protein GI150395690 
COG category[S] Function unknown 
COG ID[COG4907] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.559326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.552522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGGC TTTTAGCGGC GCTTGCGCTC GTCTTTTTCG TTCTTGCAAT GCCGCTAGAC 
GCGGCGGCGG AAGAGTTTAT CTCCGCCTAC CATTCGGTGA TCGGCGTCGC AAAGGACGGC
ACGCTGACGG TAACCGAGAC GATTACGGCC AATGTGGAGG GAAACCAGAT AAGGCGCGGC
ATCTACCGCG ACTTTCCGTT GACCTTCGTC GATGAGCGCG ACCGCCGCAG CAAGGTCGAC
TTCAAACTCC TCTCCGTCGA GCGGGATGGC GATGATGAGG ATTACCGGAC CGAATCGATC
AACGGCGGCA TCCGCATCTA CACCGGGAAC GCCGACGTTC TGCTGCCGCA TGGCGAGCAC
ACCTTCCAGA TCACCTATGA GACGAGCCGG CAGATACGCT TCTTCGATGA TCACGACGAA
CTCTACTGGA ATGTGACGGG AACCGAATGG GCATTTCCGA TCGAGGAGGC CACGGCCACC
GTGACACTGC CTGATGGAGT GAAGGCGAAG GCGCTCGACG TCTTTACCGG CGGCTACGGG
GCGACGGAAA AAGATGCGCG GGCGGTGGAG GAGGGTGACG AAATCTTCTT CGCGACGACG
CGCCGGCTGC GCCCGCAGGA AGGATTGACC GTCGCGATCA AGCTGCCCAA GGGCAGCATC
GAGCGCCCCA CTCCTTCGCA GGAAAATATC TGGTGGCTGC GCGACCACGC GGCCCTGGTC
ATCGCCGGAG CCGGCCTCCT CTTCGTGACG CTTTATTACG GGCGCGCCTG GATTCGTGTC
GGCCGCGACC CGACGCGCGG GGTCATGGTC CCGCGCTGGG ATCCTCCGGA GGGTGTCTCG
CCCGCGCTGG TCAACTACAT CGACAACAAG GGTTTTTCCG GCGGGGGATG GACGGCTCTC
TCAGCTGCGG CGCTCAACCT TGCGGTGCGG GGACATGTTG TCCTGGAAGA CCTGAAGAAT
GCGATCATCA TCACCGCCAC GGGCAAGACC GGTGAAAAGC TGCCGACCGG CGAAGCGGCC
TTGATGAGGG CGGTCGAAGC CGCTGACGGC AAGCTCACCA TCGACCGCGA GAATGGGAAG
AGGATTCAGG CCGCCGGTTC CGGCTTTCGC AGCGCGATGG AGCGCGAGCA TCGTGGAAAG
TATTACCGCG CCAATAAGAG CTACGTCGTC GTCGGCATCG TTCTTTCGGC CGCCACCCTC
GCGGCGCTGC TCATCTTCGG CGGCTTGAGC GAGGACAGCA TTCCTTTCGT GATCGTCCCG
GTTTTTCTCG CCGTCTTTAT TGCCGCCTTC GCCGTGTCGG TCGGCAAATC GTTCCGGCGC
AGCTCGAGCC TCAGACGCCG AATCCTTTCG ATCGTGGTCC TGGCTTTCAT GGGCTTCGTG
CTCTTCACCG AATTTTCGAG CATTCTCGCC GCGCTCGTCT TTTCAGCCAG CGACCCAGCC
GACCTGCCGT TGTTCTTCGC AATCGGCGGC ATCGTCCTCG TGAACGGGCT GTTCTATTTT
CTCATGGGCG CGCCCACACC GCTGGGTACG CGCATGATGG ATGGTATCGA CGGTCTCAGG
CAATACCTGA CGCTCGCCGA AAAGGACCGG CTGAACATGC AAAGCGCGCC GGAAATGTCG
CCCCGGCATT TCGAAACCCT GCTTCCTTAT GCAGTGGCTC TCGGAGTGGA GAAGCCCTGG
AGCGAGACCT TCGAGCGCTG GCTGCTTGCA GCTTCCGCCG GCGCGGCTGC GGCCGCCTAC
CAGCCGAGCT GGTATCATGG CGATTCCTTC GGCCCCGGAT CCTTCACCGA CACGATCGGC
GGTTTGGCCG GTTCGATGAC GGATAAGATC ACGTCTTCCT TGCCGCCGCC GGCCAGGAGT
TCGTCCTCCG GCTTTTCCTC CGGCGGCGGG TTTTCCGGCG GCGGCGGAGG AGGTGGCGGC
GGCGGCGGCT GGTGA
 
Protein sequence
MRRLLAALAL VFFVLAMPLD AAAEEFISAY HSVIGVAKDG TLTVTETITA NVEGNQIRRG 
IYRDFPLTFV DERDRRSKVD FKLLSVERDG DDEDYRTESI NGGIRIYTGN ADVLLPHGEH
TFQITYETSR QIRFFDDHDE LYWNVTGTEW AFPIEEATAT VTLPDGVKAK ALDVFTGGYG
ATEKDARAVE EGDEIFFATT RRLRPQEGLT VAIKLPKGSI ERPTPSQENI WWLRDHAALV
IAGAGLLFVT LYYGRAWIRV GRDPTRGVMV PRWDPPEGVS PALVNYIDNK GFSGGGWTAL
SAAALNLAVR GHVVLEDLKN AIIITATGKT GEKLPTGEAA LMRAVEAADG KLTIDRENGK
RIQAAGSGFR SAMEREHRGK YYRANKSYVV VGIVLSAATL AALLIFGGLS EDSIPFVIVP
VFLAVFIAAF AVSVGKSFRR SSSLRRRILS IVVLAFMGFV LFTEFSSILA ALVFSASDPA
DLPLFFAIGG IVLVNGLFYF LMGAPTPLGT RMMDGIDGLR QYLTLAEKDR LNMQSAPEMS
PRHFETLLPY AVALGVEKPW SETFERWLLA ASAGAAAAAY QPSWYHGDSF GPGSFTDTIG
GLAGSMTDKI TSSLPPPARS SSSGFSSGGG FSGGGGGGGG GGGW