Gene Smed_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1145 
Symbol 
ID5321991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1214802 
End bp1217201 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content65% 
IMG OID640790086 
ProductComEC/Rec2-related protein 
Protein accessionYP_001326831 
Protein GI150396364 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAGA GTCAGGTACG GATTGTCTTG CGCGACGACG AGCACAGCTC TCCGCCCCCC 
GCCGAAGGCA CCGGGGTCGT CGCCGGCAAA TTGGCGATCG CCTCCGCCGG CCGAGCGCCT
GCCGACCGGC GTTCTGCAAC CATTCGCAAC AGGACTCGTT CCACCGCCGC CGCTGTCTCC
CGCGGGATAA GTCGTGCGGC CGCCGAAGAG CAGGAATACG GCCACGGCTT CGTCCTGGTT
CCGGTCGTGC TCGCGCTCGG ATCCCTGGCA TGGTTTGCGC TGCCCGAAAC CGTCGGAACC
GCTAAACTTG CAGCGCTGCT TTGCGTTTTC GGAATATCTG CGCTGCTTTG CCGAGGGAAC
CTCCGCCCCT GGCGGCCTCT GGCGGTCGCG CCGGCACTGT TTGCTGCCGG CATGCTGCTC
GCCGCGGCCG AAACGGCCAG GCGCGACACC GTCATTCTCG ACGCGCCCGT GACAACGACT
GTTCGCGGGA CGGTTTTGTC ACGGGATCCG GACGACAGGG GACGCTGGCG CTATCTCGTG
GAGATACAGG AAACATCGAA CCCGCGGCTG CGCAGGGCAC CGGAAAGGAC AACGCTTCTG
GCGCGGAGCC GCCACGAGCC TTTTGCCATC GGGGCGATCA TCGAAGGCAA GGCCCGTTTG
TCGCCGCCTT CGGGGCCTGC TCTGCCCGGT CTCAACGACT TCGCTTTCGG CGCCTATGTT
AAAGGCTTGG GCGCGGTGGG GTTTTTCTAT GGTGCGCCGC GGGCGTTGGC GGGCGCCGCG
TCAGGCAGAG ACATAGCAGA AACTTCGCTC GGGGAACGGG CGGCTGCATA TCTGGCACCG
GTCCGCGAGG CGATCGGCAA CCGCATACGC TCGGCGATCG GTGGCGACAC GGGTGCGATC
GCTGCCGCGC TCGTGACCGG CGAGGAGCGC GCGATCAGCC GAGAGGCGGT CGAGACATTG
CGTGCGGCCG GCCTCTCGCA TGTCCTTGCG ATCTCCGGGC TTAACATGGT GCTCGCCGCG
GGTACCTTTC TGATTGGCGC CCGGACGCTG CTGAGCCTCA TTCCCGGCCT GGCCGAAAGG
CACTCCGCCA AGAAGATTGC CGCCGTCGGC GCTCTTCTCA TGGTCTTCTT CTATATCCTG
ATTTCCGGTG GCGCCGTCTC GGCAGTCAGA TCCTGGATAA TGATTTCGAT CATGCTCGTC
GCAGTGCTGT TCGACCGCGT TTCGATCAGC CTGCGCAACG TCGCGCTCGC CGCATTGATC
ATTCTCGCCT GGACGCCGTC GGCGGCAGCC GGACCCGGAT TTCAAATGTC CTTTGCGGCG
ACGCTTGCCC TTGTGGCGGG CTATGCCCGC TGGCGCGACC AGAGGAGAAA GCATCGAGAG
AGCTCGGGCA GCCGGCCGGG CATGGGCGCC GTATCCAGCC TCGCTGTAGG CACCGTTGCC
ACTTCCCTCA TTGGCGGAAT GGCGACCGCT GTCTACGCGG CTGCCCATTT CAACCGGCTT
CCCGCCTACG GGCTGGTCGC AAATGTTCTC ACGACCCCTC TGACCAGCGT ACTCATTATG
CCTTTCGCGC TGTTCGCGAT GCTGCTCATG CCATTTGGGC TCGAACATTA TCCGCTCGTG
ATCATGGGGC AGGGGCTGGA TTGGATGATG GCGGTCGCGC GATATGTCGC CGCGCTCGAT
GGCGAGTGGA CGACCGGGCG CATGGCCGAT GGGCCCTTCT TTCTCATCGC CCTCGGCGGC
ACGCTGCTTT GCGTGTTGAG AACGCGATTG GCTCTTGGAG GCGCTGGGCT CGTCGCGTTG
GGTGTATGCG TGATTGCTCT CGATCCGCGG GAGGAGCGCC CGTCGATCGT AATATCCGAG
GACGCTCAAC TGGTCGGCCT CGTCACCGCT GATGCGATCG CGACCAATCG GAGCCGTCCG
CCGGAGTTCA TCTTCTCGCA ATGGCGGCGG GCGCTTGCCG TCTCAGAGCA TGCAGTGCCG
CTCGATCTCC CTGTGTACGC GGGAAGCGAC GTCGACACGG CAGCTTTGCT GGCTTCCGCC
AAAGTGGGCG TTTTTACGTG CCGGAAAGGC ACCGGGTGCG TCGGTCGTAG CCGGGAAGGT
TGGACAGTAG CGATTGTCGA AAAGGCCGAA TTGGTGCCCC TCTTGTGCGG TCACGCCGAT
TTGGTCGTCG TTGCCAGCCG CCGGCCTGTG GCAGCCTGCC CGCCCGGCGC ATCGCTGATC
ATAAGCACCA GGACGTTGCG ACGCACGGGG TCCGTTGAAA TTCATGCCGT GCCGGAGCAG
GCGGGATCAC CGCGCATGCA GGTCGTCGGC TCGTTCTCCT CCACGGAAAG ACCCTGGCAG
CGTCATCGCC GCTATGACTG GCGGGCCGGC AGTTTCGTGC CTGAGGGGTC ACCGCTTTGA
 
Protein sequence
MAESQVRIVL RDDEHSSPPP AEGTGVVAGK LAIASAGRAP ADRRSATIRN RTRSTAAAVS 
RGISRAAAEE QEYGHGFVLV PVVLALGSLA WFALPETVGT AKLAALLCVF GISALLCRGN
LRPWRPLAVA PALFAAGMLL AAAETARRDT VILDAPVTTT VRGTVLSRDP DDRGRWRYLV
EIQETSNPRL RRAPERTTLL ARSRHEPFAI GAIIEGKARL SPPSGPALPG LNDFAFGAYV
KGLGAVGFFY GAPRALAGAA SGRDIAETSL GERAAAYLAP VREAIGNRIR SAIGGDTGAI
AAALVTGEER AISREAVETL RAAGLSHVLA ISGLNMVLAA GTFLIGARTL LSLIPGLAER
HSAKKIAAVG ALLMVFFYIL ISGGAVSAVR SWIMISIMLV AVLFDRVSIS LRNVALAALI
ILAWTPSAAA GPGFQMSFAA TLALVAGYAR WRDQRRKHRE SSGSRPGMGA VSSLAVGTVA
TSLIGGMATA VYAAAHFNRL PAYGLVANVL TTPLTSVLIM PFALFAMLLM PFGLEHYPLV
IMGQGLDWMM AVARYVAALD GEWTTGRMAD GPFFLIALGG TLLCVLRTRL ALGGAGLVAL
GVCVIALDPR EERPSIVISE DAQLVGLVTA DAIATNRSRP PEFIFSQWRR ALAVSEHAVP
LDLPVYAGSD VDTAALLASA KVGVFTCRKG TGCVGRSREG WTVAIVEKAE LVPLLCGHAD
LVVVASRRPV AACPPGASLI ISTRTLRRTG SVEIHAVPEQ AGSPRMQVVG SFSSTERPWQ
RHRRYDWRAG SFVPEGSPL