Gene Smed_5186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5186 
Symbol 
ID5319488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp140192 
End bp141946 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content58% 
IMG OID640776964 
Producthypothetical protein 
Protein accessionYP_001313896 
Protein GI150377301 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.510338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.40998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG CTGACCGGGA GATCGTCCGA GCCGGATCGC CGACCGCTCT CCCTACGAAC 
ACGGAAATTC TTGCACAACT CGATCGAATA CGGTCAAGCG CCGAATTCGA CGTGCCGGAC
CGAGCGCGCA AATTCCTGGC CTACATTGTC GGAGAGGCAA TTGCGGGCCG TGCCGACCGG
ATAAAGGCCT ATTCGATCGC CACCGAAGTG TTCGGGCGGG ATTCTTCTTT CGATGCTCAA
ACGGATCCAG TCGTGCGGAT CGAAGCAGGT CGTATCCGAC GCGCCCTCGA GCGATATTAT
TTCGTTGCGG GGAGCAACGA TCCAATCGTG ATAAAAATGC CCAAGGGCGG CTACGCGCCC
GCCTTCGAAA AACGGCTCGG TGCGCTGTAC CCCCTAACTT CCGGACAGGC CGCAGACGTT
CAGTCGCGTT CCATGCCGCT TGAGCAGACC GCGCTGTGGG TCAGTGTGGC CACAGTGGGT
CTCCTGACAT GTGGTTTGAT CGCAAATGCA TTTTTTGGCT CGGCGGCAAC GACATTCGAG
AGCCTCACGA AGTCTGGCGG CACTCCAAAC ATTCCGAAAC TGATGGTGAT GCCGTTTGAA
GACCTTTCAC AAACGCCGCA ATCGGCGATG ATCACACGCG GACTAACAGA TGAGGTCATC
AACAACATCG CCAAGTTCAA GGAGATTGTT GTGGTCGCCG GGCCGGCAGC CCCGAACCCT
CATAGCGCCG AAAGGGAATA TCCGGCCTTT GCGTTAGAAG GCCGAGTCCG GCTCGATGGC
GACAAACTTC GGCTGGGCAT ACGGCTAGTC CAGCATTCCG ATGGTTCAGT CGTGTGGGCG
AACACCTATG ACGAAGTACT ACAGCCGCGT AAGATCATCG AGTTGCAGCA GAACGCCGCT
GCTGCAGTGG CTAGTGCCAT CGCACAACCA TACGGCATTG TTTTCCAGGC CAACGCCACA
CACTTCATGC GCTCTGTCCC GGACGATTGG CAGGCTTATG CGTGCACCCT GGCCTACTAC
GGGTATCGGG GCGATCTAAA TCCGCAGACA CATGCCTCCG TTCAAGAGTG TCTTCAGCAT
GCAACGACAC AGTTTCCCGA TTATGCTACG GCATGGGCAC TTCTCTCACT GACCTATGTT
GACGAGCTTC GCTTTCGATA TCGCCTGAAC CGATCGACGT CGGTGTCGCT CACACACGCG
ATCGAGGCTG CGGCGCGTGC GGTGGAGCTT GACCCTCAGA ACGTTCGCGC ACTGCAGGCG
GAGATGCTTA CATTATTTTT CCGCGGCGAA GTTAGCGCTG CCCTGGCGGT CGGCGCGCGC
GCTTATGCGA TCAATCCCAA CGACGCCGAG TTCTCGGGCG AATATGGTTT CCGGCTCGCA
CTGTCCGGTC AGTGGCGTTC GGGGTGCGAC CTTGTGTCGA AAACCGTAGC GAGCAATCCC
GGGCCGACAG GATATTTCGA GGCAGCGCTG GCGGTCTGTT GCTATATTGA ACACGACTAC
GTCGCCGCCG AACGGTGGGC CCGGTTGGCG GACCTTCACG CCAATCCTGT CTACCATGTC
ATTTTACTCG CCATTCTTGG CAAGCTCGGC AAGATGGACT TGGCTCGTGC CGAGAGACAA
TGGCTTGAAA CCAACGTACC TGGTTTCCTC GAGAACGCAC GAAACGAAGT CGCATTGCGA
ATTCACCGCC CGGAAGACCG AGAGCATTTC ATAGAGGGTT TGCGCCAGGC AGGCGTACCC
GTTCCCGGAA ATTGA
 
Protein sequence
MTIADREIVR AGSPTALPTN TEILAQLDRI RSSAEFDVPD RARKFLAYIV GEAIAGRADR 
IKAYSIATEV FGRDSSFDAQ TDPVVRIEAG RIRRALERYY FVAGSNDPIV IKMPKGGYAP
AFEKRLGALY PLTSGQAADV QSRSMPLEQT ALWVSVATVG LLTCGLIANA FFGSAATTFE
SLTKSGGTPN IPKLMVMPFE DLSQTPQSAM ITRGLTDEVI NNIAKFKEIV VVAGPAAPNP
HSAEREYPAF ALEGRVRLDG DKLRLGIRLV QHSDGSVVWA NTYDEVLQPR KIIELQQNAA
AAVASAIAQP YGIVFQANAT HFMRSVPDDW QAYACTLAYY GYRGDLNPQT HASVQECLQH
ATTQFPDYAT AWALLSLTYV DELRFRYRLN RSTSVSLTHA IEAAARAVEL DPQNVRALQA
EMLTLFFRGE VSAALAVGAR AYAINPNDAE FSGEYGFRLA LSGQWRSGCD LVSKTVASNP
GPTGYFEAAL AVCCYIEHDY VAAERWARLA DLHANPVYHV ILLAILGKLG KMDLARAERQ
WLETNVPGFL ENARNEVALR IHRPEDREHF IEGLRQAGVP VPGN