Gene Smed_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5041 
Symbol 
ID5319090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1561872 
End bp1563110 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID640776822 
Producthypothetical protein 
Protein accessionYP_001313754 
Protein GI150377158 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACA GGTTCACATT GGCCGTAACC GGCCAATCGC TCATCAAGCA CGATATCCGC 
GACATTCCTG CTCCTGCCTT CCGTGAGGTT CAGTCCCTCC TTCGCCAGGC GGATCTTTCT
TTCACCAACT TCGAGGGAAC GATCCTCGGT ACGCACGGGG GTTGGCCGCT CAAAGGTTCG
TTCTTCGGGT GCAGCGACCC GGCCGTTCTC GATGCACTTG GCGCCATCGG CTTTCAGGCA
TTGTCCCTTT CGAACAATCA TGCCTTCGAC CTCGGACCAT CCGGGGTGCT TTCGACGCTG
GAGGAGGTGG AGAAACGAGG CTTTTTCCAT GCCGGTCTCG GCCGCAACGC GCGAGAAGTC
TCGCGTGCAA GCATCGCCAC GATCAACCAA CGCCGTATTG CCCTCGTTGC GATGGACGGG
GGCCCCGGAC CCGATTTCAT GTATGCCGCG GACGCGGACG AAAATCGCCC CGAACGCCCC
GGTGTGAACC GGCTTCGTCT TTCACAGCTC CTCGAGGTCG ACGATCTCGC GTTTGAGCAG
ATCCGGGCGG TTCGCGACAA GATCGGCTAC ACTGCGATCG ACCTCGCCAA TGACAGCCAG
CCGGACGATC CGCCGCGCCT CGACCCGCAG GCTGAGGTCG CTATCGCCCG CTCTGTCTTC
AAACGGTCCG ACCGGTACGG ACGCGGTGTA AGGATAGATG AGGTCGACAT GGCAAGAAAC
CTTGCCGCAA TCGCCTCTGC AGCCAGGGAC AGCGCACTGG TGATCGCCTA TCTCCATCAT
CATCACTGGG CTTCCGACTG GTATCAGCTG CCCGAATGGG TGAGCGGTGT GGCCAAACGA
TGCATCGACG CAGGCGCGTC CATGTTTGTC AGTCACGGCG CGCCGGTGCT GCAACCGGTC
GAAATCTATC GAGGCCGGCC AATCTTCTAC AGCCTGGGCA ATTTCATCTT CCATGTCCAA
TCGGAGAAGT CGACCTGGAC CGCAGCGGAA GTCTGGGAAA GTGTCGTCGG GGTTTGCTCC
TTTGCCAGCG ACAACAGCCT CATCAACATC AGCTTCCATC CCGTCGTCAT CGGCGGCGAG
CATGGATTGG AGGACGGGGT GTTGGAACGT CGGCTGGTTC CACAGCTTGT AACCGGAGAC
AGCGCGGTCA GGATCCTTGG CCGGCTTCAG GAGCAATCGG CGCGACTGGG CGCGCATATA
GAAATCTCCG GCAATGTCGG CAGGCTGCAA GCGCGGTAG
 
Protein sequence
MNDRFTLAVT GQSLIKHDIR DIPAPAFREV QSLLRQADLS FTNFEGTILG THGGWPLKGS 
FFGCSDPAVL DALGAIGFQA LSLSNNHAFD LGPSGVLSTL EEVEKRGFFH AGLGRNAREV
SRASIATINQ RRIALVAMDG GPGPDFMYAA DADENRPERP GVNRLRLSQL LEVDDLAFEQ
IRAVRDKIGY TAIDLANDSQ PDDPPRLDPQ AEVAIARSVF KRSDRYGRGV RIDEVDMARN
LAAIASAARD SALVIAYLHH HHWASDWYQL PEWVSGVAKR CIDAGASMFV SHGAPVLQPV
EIYRGRPIFY SLGNFIFHVQ SEKSTWTAAE VWESVVGVCS FASDNSLINI SFHPVVIGGE
HGLEDGVLER RLVPQLVTGD SAVRILGRLQ EQSARLGAHI EISGNVGRLQ AR