Gene Smed_6107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6107 
Symbol 
ID5320409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1039381 
End bp1040751 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID640777747 
ProductTfuA domain-containing protein 
Protein accessionYP_001314679 
Protein GI150378084 
COG category[S] Function unknown 
COG ID[COG3482] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.620154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGTCG AGACCAACAA TGAGATTGCC GTTTTTCTGG GCCCGTCCTG CTCGATTGAA 
GAAGCTAAAA GCATCCTGCC GGAGGCAGAT TACTTTCCGC CCGCGGCTCG TGGCTCCATC
TACGGTATTA TCAATGATGG ATACCGGATG ATCGTCCTTC TCGACGGTCT GTTCTACGGA
CAATATTCTG TTTGGCATAA GGAACTCCTG TTCGCCCTGG ATTGCGGCAT TGAAGTAATC
GGCGCAACGA GCATGGGTGC CCTGCGGGCG GCCGAACTGG ACTGCGAGGG CGTGACCGGT
GTCGGGCAGA TCTACCAGTG GTTTCGTGAC GGCGAAATCG ATGGCGATGA CGAAGTGGCT
CTGCTCCACC AGAGTTCCGA GGGAGCTTAC GCGCCTCTTT CCATCCCTCT CGCCAATTTG
CGTTGGAATC TGCGCCTTGC CCGAAGAGAA TGTATGATCG ACGAGCAGCA GGAAGCGCGG
ATTCTGGACC ATGCAAAGGC CCTGTGTTTC CAGGACCGAA TGATGGAGGT CGTGCTTTCG
CCGCTGGCAA AGGAACTGGA TGTTTCTGGC TTGCAAAAAT GGCTTGAAAC GCATGGAGAG
GATCTAAAAA AAAGGGATTG CTTAGAAGCG TTGCGCTTTG CCGCCAGCCG GATCGCGACC
CTTGGCCCCG CGCAGCCTCC CCGTCTTCGC GCCTATGAGA CGGTGCATAC GCTGATCGGC
ATCGAGTATT TCACGCACGA GCGGTTGAAT GCCATCTGCG CAAAACGGCA AGGCAAGGCC
ATGCCGCTCT CGCAGTACAC CGAGCGAGTT GCCGTGGGAG ATCCCTCCTA TCGTAGCTAT
CTGCGCGCTC GCGCTTGCCA GCGCCTTATC AATGGCTGGG CTCGGGAACT TCACCTCGAA
ATCACCCCGG CGCCCATTCT GCCCCAATGG TTACCCAATC GGTTTGACCT CGCTCACCGC
AGGGCAACCG GCCTGACGCT AATCGATATT GCACGTGAGG GACGGGAAGC CGCCTTCACC
GCAGGCGTCA TCGGATCTCT CTCATCACCC GCCAGCCGCA GCCTCGTCTC CGATATTGAC
CGGCAACTGG CAGAGGCAGG TTTCTCGGCA TGGGGCGCGG GAGCCGAAAT CACACATTTG
GATGGCCGCC TTGTTTATAC GCTTCGTCGC CTGGGGCGCG AAAAGGGCAT TTTGCCGGAC
GAGAATGAAG ATGCAGATAC CGCCGAAGCG GCGCTCCACT ACCTTGAATG GGTCTATCGC
ACCGGCTTGC AGCACTTCGG CTACACATTT GACGCGGCAA CCGAAATCCT CCTGGCACAC
CAATTTGCCG ATCGACTTGA CGACATTCTC GGATTGAGGG CTGCATCATG A
 
Protein sequence
MVVETNNEIA VFLGPSCSIE EAKSILPEAD YFPPAARGSI YGIINDGYRM IVLLDGLFYG 
QYSVWHKELL FALDCGIEVI GATSMGALRA AELDCEGVTG VGQIYQWFRD GEIDGDDEVA
LLHQSSEGAY APLSIPLANL RWNLRLARRE CMIDEQQEAR ILDHAKALCF QDRMMEVVLS
PLAKELDVSG LQKWLETHGE DLKKRDCLEA LRFAASRIAT LGPAQPPRLR AYETVHTLIG
IEYFTHERLN AICAKRQGKA MPLSQYTERV AVGDPSYRSY LRARACQRLI NGWARELHLE
ITPAPILPQW LPNRFDLAHR RATGLTLIDI AREGREAAFT AGVIGSLSSP ASRSLVSDID
RQLAEAGFSA WGAGAEITHL DGRLVYTLRR LGREKGILPD ENEDADTAEA ALHYLEWVYR
TGLQHFGYTF DAATEILLAH QFADRLDDIL GLRAAS