Gene Smed_5412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5412 
Symbol 
ID5319714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp375289 
End bp376527 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content61% 
IMG OID640777178 
Producthypothetical protein 
Protein accessionYP_001314110 
Protein GI150377515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.264785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACA GGTTCACATT GGCCGTAACC GGCCAATCGC TCATCAAGCA CGATATCCGC 
GACATTCCTG CTCCTGCCTT CCGTGAGATT CAGTCCCTCC TTCGCCAGGC GGATCTCTCA
TTTACCAACT TCGAGGGAAC GATCCTTGGG CGTCACGGCG GGTGGCCGCT CAAAGGTTCG
TTCTTCGGGT GCAGCGACCC GGCCGTTCTC GATGCACTTG GCGCCATCGG CTTTCAGGCA
CTGTCCCTTT CGAACAATCA TGCCTTCGAC CTCGGACCTT CAGGTGTGCT TTCGACGCTC
GAGGAGGTGG AGAAACGAGG CTTTCTCCAT GCCGGTCTCG GCCGCAACGC GCGAGAGGTC
TCGCGTGCGA GCATTGCCAC GATCAACCAA CGGCGTATTG CCCTCGTTGC GATGGACGGT
GGCCCCGGAC CCGATTTCAT GTATGCCGCG GACGCGGACG AAAATCGCCC CGAACGCCCC
GGTGTGAACC GGCTTCGTCT TTCACAGCTC CTCGAGGTCG ACGATGTCGC GTTTGAGCAG
ATCCGGGCGG TTCGCGACAA GATCGGCTAC ACTGCCATCG ACCTCGCCAA TGACAGCCAG
CCGGACGATC CCCCGCGCCT CGACCCGCAG GCTGAGGTCG CTATCGCCCG CTGTGTCTTC
AAACGGTCCG ACCGGTACGG ACGCGGTGTA AGGATAGATG AGGTCGACAT GGCAAGAAAC
CTTGCCGCGA TCGCCTCTGC AGCCAGGGAC GAGGCACTGG TGATCGCCTA TCTCCATCAT
CATCACTGGG CCTCCGACTG GTATCAGCTG CCCGAATGGG TGAGCGGTGT GGCCAAACGA
TGCATCGACG CAGGCGCGTC CATGTTTGTC AGTCACGGCG CGCCGGTGCT GCAACCGGTC
GAAATCTATC GAGGCCGGCC AATCTTCTAC AGCCTGGGTA ATTTCATCTT CCATGTCCGA
TCGGAGAAGT CGACCTGGAC CGCAGCGGAA GTCTGGGAAA GTGTCGTCGG GGTTTGCTCC
TTTGCCAGCG ACAACAGCCT CATCGACATC AGCTTCCATC CCGTCGTCAT CGGGGGCGAC
GATGGATTGG AGGACGGGGT GTTGGAACGT CGGCTGGTTC CACAGCTTGT AACCGGAGAC
AGCGCGGTCA GGATCCTTGG CCGGCTTCAG GAGCAATCTG CGCGACTGGG CGCGCATATA
GAAATCTCCG GCAACGTCGG CAGGCTGCAA GCGCGATAG
 
Protein sequence
MNDRFTLAVT GQSLIKHDIR DIPAPAFREI QSLLRQADLS FTNFEGTILG RHGGWPLKGS 
FFGCSDPAVL DALGAIGFQA LSLSNNHAFD LGPSGVLSTL EEVEKRGFLH AGLGRNAREV
SRASIATINQ RRIALVAMDG GPGPDFMYAA DADENRPERP GVNRLRLSQL LEVDDVAFEQ
IRAVRDKIGY TAIDLANDSQ PDDPPRLDPQ AEVAIARCVF KRSDRYGRGV RIDEVDMARN
LAAIASAARD EALVIAYLHH HHWASDWYQL PEWVSGVAKR CIDAGASMFV SHGAPVLQPV
EIYRGRPIFY SLGNFIFHVR SEKSTWTAAE VWESVVGVCS FASDNSLIDI SFHPVVIGGD
DGLEDGVLER RLVPQLVTGD SAVRILGRLQ EQSARLGAHI EISGNVGRLQ AR