Gene Smed_5971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5971 
Symbol 
ID5320273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp929726 
End bp931201 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content57% 
IMG OID640777652 
Producthypothetical protein 
Protein accessionYP_001314584 
Protein GI150377989 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.346043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCA CTCGGAGGGA TGCGTTCAAA ATGGCTTCGT CTGCTGCAAT TCTTGGAGCT 
AGCGCGGCGG CAGCACACGA TGCTGCCGCC AAAGCCGTCC CGCAGGGTGT CGATCTGGAG
TTCGACCTCG GAATTCCTAC GCAAGATACG GTCGAGAAGC TTTACGACAC GATGGATTTT
CAACGCGCAG TGCAGGGCTA TCTCTGGGCG GTTCCGATCG TCGGAATGGA AGGTGCGCGC
CGGATGCTTG TCGACAACGC CGAAGCCAGG AGCGGCGATC TTGTGCTTGT TGCCGGGTAT
AGGGACGTCA GTGCCATGCT CGGGTCGAAT GTGACGACGC CCTATGTGTT CGCGTGGTTT
GATCTTACAG AAGGACCGAT TGTCATCGAA TATCCCGAAG GCGCCACGGC CGGCTCCCTG
ATCGACTGGT GGGATCGTCC GCTCATTGAT GTCGGCGTTT CGGGCCCAGA TGGCGGCAAG
GGAGCGAAAT TCGTAGTGGT TGGTCCGGCG CATGAAGCGC CGGAAAATTC CCCGGCTGGC
GCAAAACTGC TGCGTTCCCG CACCAACAAA GTCCTGTTGT TCTGCCGGGG ACTCGATGGT
GACCTCAAGA CGGTCGAGGC TGTTTTCTCT AACACTCAGG TCTATCCGCT TGGCGCTACA
GGGAGTGGAG TGGCCGCGTT TCTTAGATTC AAGACGGAGG GCGAGTTGAC CAGCATGGCT
CATCCTAAGG GCCTCGCATA CTGGCAGTCG TTGATTCAGG CGCTGGATGG TGAGCAGATC
GAGGATCGAG ACCGCTTCTT TGCCGCTATG TTGAAGCCGC TCGGCGTCAC CTATGGCGGA
TCGTTCTCGC CAAACGACCG GCAGACGGGG TTACTTCACA ACGCCGCAAT CCTCGGCGAA
GCGATGGCGA AGGCCAGTGC TTTCAGCAAG CGCATTCCAG GGATGCGCTA TAGGGACGAT
ACGCACTGGG AATATTTGAT CCCTCAGGAC TTTGTCAACG AACAGGACGG ACCGGATGGT
ACTCTGCTCG ATCAACGGAC GGCCTTCTTC TACGAGGTCA CAGGCACTTC CGCCGCCGTT
CTCACCAAGA CACCCGGAAC TGGCTCGGCA TACCTCACCG CCTACAGTGA TCCTGACGGA
CACGCTTTCG ACGGCGCAAA GTCTTACCGG TTGCGCGTTC CAGCCAATGT ACCCGCCAAG
ACCTTTTGGT CGATCACGCT CTACGACACC GAGACGCGCG GTCTCATTCA GAACAAGCAA
CAGATCGTGG ATCGGTCCTC ACGGCAAAAT CTCAAGGTCC AAAACGACGG CTCGATTGAG
ATCGTTATGG GACCGCAGAC TCCGGATGGC CTGGAGCAGA ACTGGATACC GACAACGCCA
GGTAAGGCTT GGTTTGTGTA TTTCCGCTTG TTCGGTCCGC TAGAGCCATA TTTCGACAAG
TCGTGGCGCT TGCCTGACAT TGAGAAGGCC ATATAA
 
Protein sequence
MEITRRDAFK MASSAAILGA SAAAAHDAAA KAVPQGVDLE FDLGIPTQDT VEKLYDTMDF 
QRAVQGYLWA VPIVGMEGAR RMLVDNAEAR SGDLVLVAGY RDVSAMLGSN VTTPYVFAWF
DLTEGPIVIE YPEGATAGSL IDWWDRPLID VGVSGPDGGK GAKFVVVGPA HEAPENSPAG
AKLLRSRTNK VLLFCRGLDG DLKTVEAVFS NTQVYPLGAT GSGVAAFLRF KTEGELTSMA
HPKGLAYWQS LIQALDGEQI EDRDRFFAAM LKPLGVTYGG SFSPNDRQTG LLHNAAILGE
AMAKASAFSK RIPGMRYRDD THWEYLIPQD FVNEQDGPDG TLLDQRTAFF YEVTGTSAAV
LTKTPGTGSA YLTAYSDPDG HAFDGAKSYR LRVPANVPAK TFWSITLYDT ETRGLIQNKQ
QIVDRSSRQN LKVQNDGSIE IVMGPQTPDG LEQNWIPTTP GKAWFVYFRL FGPLEPYFDK
SWRLPDIEKA I