Gene Smed_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3504 
Symbol 
ID5324392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3709638 
End bp3710894 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content64% 
IMG OID640792456 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_001329157 
Protein GI150398690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC CCCTCCAATC CGTGAAAAAC CCTGTCGATC CGGTCAAGCT CGAAAAGCTC 
GCCGAGGTCG CCATCAAGAT CGGCCTGCAA TTGCAGAAGG GCCAGGATCT GGTGATGACC
GCGCCGATCG CGGCGCTGCC GCTCGTCCGG CTCATCACCA AGCATGCCTA TAAGGCGGGG
GCCGGACTGG TGACGACCTT CTACGCCGAC GAGGAAACGA CTCTCGCCCG CTATGCCCAT
GCGCCGGACG AGAGTTTCGA CCGCGCCAGC GACTGGCTTT ACGAGGGGAT GGCCAGAGCC
TATGCCGGTG GCGCTGCGCG CCTTGCCATC GCCGGCGACA ATCCGATGCT GCTTTCGGCT
CAGGATCCGA CGAAGGTCGC GCGCGCCAAC AAGGCGAACT CGATCGCCTA CAAGCCGGCG
CTGGAGAAGA TATCAAACTT CGACATCAAC TGGAACATCG TCTCCTATCC GAACCCCTCC
TGGGCGAAGC AGATGTTCCC GGACGATCCC GAAGCGGTTG CGGTGGAAAA GCTGGCAAAC
GCCATCTTCG CCGCATCGCG GGTCGACGTC GACGATCCGA TCGCCGCCTG GAAGGAGCAC
AACGCCAATC TGCACCAGCG CTCGAACTGG CTCAACGAGG AACGTTTCGC CGCCCTCCAC
TTCACGGGAC CCGGCACCAA CCTGACCATC GGATTGGCGG ACGGCCACGA GTGGCATGGC
GGCGCCTCGG TCGCCAAGAA CGGCATCACC TGCAATCCGA ACATCCCGAC CGAAGAGGTC
TTCACCACGC CGCACGCGCT GCGCGTCGAA GGCCATGTGT CGAGCACCAA GCCGCTCTCC
CATCAGGGCA CGCTGATCGA CAATATTCAG GTGCGTTTCG AAGGCGGCCG TATCGTCGAG
GCCAAGGCCG CGCGCGGCGA AGAGGTTCTG AACAAGGTGC TCGATACGGA CGAGGGCGCC
CGCCGCCTCG GCGAGGTGGC GCTTGTGCCG CATTCCTCGC CGATCTCGGC AAGCGGCATC
CTCTTCTACA ACACCCTCTT CGACGAGAAT GCCTCCTGCC ACATCGCGCT TGGCCAGTGC
TATTCGAAGT GTTTCCTCGA CGGGGCGAGC CTCAGCCAGG AGCAGATCCG CGCCCAGGGC
GGCAATGCGA GCCTGATCCA CATCGACTGG ATGATCGGTT CCGGTGAGGT CGACATCGAC
GGCGTACGCG CCGACGGCGG CCGTGTGCCC GTCATGCGCA AGGGCGAGTG GGCCTGA
 
Protein sequence
MNAPLQSVKN PVDPVKLEKL AEVAIKIGLQ LQKGQDLVMT APIAALPLVR LITKHAYKAG 
AGLVTTFYAD EETTLARYAH APDESFDRAS DWLYEGMARA YAGGAARLAI AGDNPMLLSA
QDPTKVARAN KANSIAYKPA LEKISNFDIN WNIVSYPNPS WAKQMFPDDP EAVAVEKLAN
AIFAASRVDV DDPIAAWKEH NANLHQRSNW LNEERFAALH FTGPGTNLTI GLADGHEWHG
GASVAKNGIT CNPNIPTEEV FTTPHALRVE GHVSSTKPLS HQGTLIDNIQ VRFEGGRIVE
AKAARGEEVL NKVLDTDEGA RRLGEVALVP HSSPISASGI LFYNTLFDEN ASCHIALGQC
YSKCFLDGAS LSQEQIRAQG GNASLIHIDW MIGSGEVDID GVRADGGRVP VMRKGEWA