Gene Smed_5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5042 
Symbol 
ID5319091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1563228 
End bp1564472 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID640776823 
Productpeptidase T 
Protein accessionYP_001313755 
Protein GI150377159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.918686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGC GCGACGAGCT GGTCAGCCGG TTTTTCCGGT ACGTGGCCAT TGAGAGCCAG 
AGTAACGGTC ACTCTGCGTC CCTGCCCTCC TCCCCCGGCC AGTTCGAGCT TGCCTCCTTG
CTGGCCGAGG AGTTGCGGAT GCTCGGGATC GAGGACGTCG TGCTCGATCA GCAGGCGATT
GTGACCGGCG TGAAGCGTGG CACCAGGCCC AATGCGCCAA GGATCGGGTT CATCGCGCAT
CTCGATACGG TCGATGTCGG CCTTTCTGCC ATTATCCGGC CGCAAATTCT CCGGTTCGAA
GGCACGGACC TTTGCCTCAA CCCTCAGGAG GACATCTGGC TGCGCGTTGC CGAACACCCG
GAACTCCTCG CCTGGTCGGG GGAAGACATC ATCGTCAGCG ACGGCACCAG CGTGCTCGGA
GCGGACAACA AGGCGGCGAT CGCGGTCATC ATGACACTTC TCGCCCGGCT CGATGCGCAA
GGCGCCCACG GCGACGTCTT CGTCGCCTTC GTGCCGGACG AGGAAATCGG TCTGCGCGGC
GCCAAGGCGC TCGATCTGGC GCGCTTTGCA TGTGACTTCG CCTACACGAT CGATTGTTGC
GAGCTCGGCG AAGTCGTGCT CGAGACCTTC AACGCCGCAT CGGCTGAAAT CATCTTTACC
GGCGTCAGCG CGCACCCGAT GGCCGCAAAG GGCACCCTCG TGAACCCCCT TTTGATGGCG
CTGGACTTCG TCTCGCACTT TGATCGCAAG GATACACCTG AATGCACGCA GGATCGGCAA
GGGTTCTTCT GGTTCAAAGA GCTTGTTGCG CATGACAGCA AGGCAACACT CAACGTGCTG
GTTCGCGACT TCGATGCGGC AGAATTCGAA CGGCGCAAGC AGCAGCTCCT TGCCATAACG
GCGCTGGTCA ACGCGCACTA TCCCTCCGGC CGCGTCGAGT GCCGGTTGAC CGACACCTAC
CACAATATCG GCCGCCGCCT GCGCGACGAC AGCCGCCCGG GAACGCTGTT GTTCCAGGCT
TTCGACGCGC TCGGGATTGA ACAAAAGCGC ATTCCGATGC GCGGCGGCAC CGATGGCGCC
GTCCTCTCGG CCCGGGGAAT ACCGACGCCG AACTTCTTCA CCGGCGCCTA CAACTTCCAT
TCCCGATTTG AATTCCTGCC GGTCTCAGCT TTCGAAAAGT CGTTCGAGGT TGCAGCCATG
CTTTGCAAAC TGGCGGCGCA GGATGAGGCG TTGGCCGACC GCTAA
 
Protein sequence
MRTRDELVSR FFRYVAIESQ SNGHSASLPS SPGQFELASL LAEELRMLGI EDVVLDQQAI 
VTGVKRGTRP NAPRIGFIAH LDTVDVGLSA IIRPQILRFE GTDLCLNPQE DIWLRVAEHP
ELLAWSGEDI IVSDGTSVLG ADNKAAIAVI MTLLARLDAQ GAHGDVFVAF VPDEEIGLRG
AKALDLARFA CDFAYTIDCC ELGEVVLETF NAASAEIIFT GVSAHPMAAK GTLVNPLLMA
LDFVSHFDRK DTPECTQDRQ GFFWFKELVA HDSKATLNVL VRDFDAAEFE RRKQQLLAIT
ALVNAHYPSG RVECRLTDTY HNIGRRLRDD SRPGTLLFQA FDALGIEQKR IPMRGGTDGA
VLSARGIPTP NFFTGAYNFH SRFEFLPVSA FEKSFEVAAM LCKLAAQDEA LADR