Gene Smed_5411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5411 
Symbol 
ID5319713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp373928 
End bp375172 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID640777177 
Productpeptidase T 
Protein accessionYP_001314109 
Protein GI150377514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.683579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.180292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGC GCGACGAGCT GGTCAGCCGG TTTTTCCGGT ACGTGGCCAT TGAGAGCCAG 
AGTAACGGTC ACTCTGCGTC CCTGCCCTCC TCCCCCGGCC AGTTCGAGCT TGCCTCCTTG
CTGGCCGAAG AGTTGCGGAT GCTCGGGGTC GAGGACGTCG TGCTCGACGA GCAGGCGATT
GTGACCGGCG TGAAGCGCGG CACCAGGCCC AATGCGCCAA GGATCGGGTT CATCGCGCAT
CTCGATACGG TTGATGTCGG TCTCTCTGCC ATTATCCGGC CGCAAATTCT CCGGTTCGAA
GGCACGGACC TTTGCCTCAA CCCTCAGGAG GACATCTGGC TGCGCGTCGC CGAACACCCG
GAACTCCTCG CCTGGCCGGG GGAAGACATC ATCGTCAGCG ACGGCACCAG CGTGCTCGGC
GCGGACAACA AGGCGGCGAT CGCGGTCATC ATGACACTTC TCGCCCGGCT CGATGCGCAA
GGCGCCCATG GCGACGTCTT CGTCGCCTTC GTGCCGGACG AGGAAATCGG TCTGCGCGGC
GCCAAGGCGC TCGATCTGGC GCGCTTTGCA TGTGACTTCG CCTACACGAT CGATTGTTGC
GAGCTCGGCG AAGTCGTGCT CGAGACCTTC AACGCCGCAT CGGCTGAAAT CGTCTTTACC
GGCGTCAGCG CACACCCGAT GGCCGCAAAG GGCACCCTCG TGAACCCGCT TTTGATGGCG
CTGGACTTCG TCTCGCACTT TGATCGCAAG GATACACCTG AATGCACGCA GGATCGGCAA
GGCTTCTTCT GGTTCAAAGA GCTTGTTGCG CATGACAGCA AGGCAACACT CAACGTGCTC
ATTCGCGACT TCGATGCGGC AGAATTCGAA CGGCGCAAGC AGCAGCTCCT TGCCATAACG
GCGCTGGTCA ACGCGCACTA TCCCTCCGGC CGCGTCGAGT GCCGGTTGAC CGACACCTAC
CACAATATCG GCCGCCGCCT GCGCGACGAC AGCCGCCCGG GAACGCTGTT GTTCCAGGCT
TTCGACGCAC TCGGGATTGA ACGAAAGCGC ATTCCGATGC GCGGCGGCAC CGATGGCGCC
GTCCTCTCGG CACGGGGAAT ACCGACGCCA AACTTCTTCA CCGGCGCCTA CAACTTCCAT
TCCCGATTTG AATTCCTGCC GGTCTCAGCT TTCGAAAAGT CGTTCGAGGT TGCAGGCATG
CTTTGCAAAC TGGCGGCCCA GGACGAGGCG TTGGCCGACC GCTAA
 
Protein sequence
MRTRDELVSR FFRYVAIESQ SNGHSASLPS SPGQFELASL LAEELRMLGV EDVVLDEQAI 
VTGVKRGTRP NAPRIGFIAH LDTVDVGLSA IIRPQILRFE GTDLCLNPQE DIWLRVAEHP
ELLAWPGEDI IVSDGTSVLG ADNKAAIAVI MTLLARLDAQ GAHGDVFVAF VPDEEIGLRG
AKALDLARFA CDFAYTIDCC ELGEVVLETF NAASAEIVFT GVSAHPMAAK GTLVNPLLMA
LDFVSHFDRK DTPECTQDRQ GFFWFKELVA HDSKATLNVL IRDFDAAEFE RRKQQLLAIT
ALVNAHYPSG RVECRLTDTY HNIGRRLRDD SRPGTLLFQA FDALGIERKR IPMRGGTDGA
VLSARGIPTP NFFTGAYNFH SRFEFLPVSA FEKSFEVAGM LCKLAAQDEA LADR