Gene Smed_5198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5198 
Symbol 
ID5319500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp156791 
End bp158116 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content60% 
IMG OID640776976 
Productamino acid permease-associated region 
Protein accessionYP_001313908 
Protein GI150377313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.043341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAC CGTCAAGTGG CAAGAGCCTG GGTCTGGCCG CCTGTACCGC CATCGTCGTC 
GGTAACATGG TTGGCTCCGG CTTCTACCTC TCGCCGGCGG CAGTTGCTCC CTACGGCAAT
CTCGCTATCG TCATCTGGAT CGTGATGGGA GCGGGTGCGA TCTGTCTCGG GCTGACGTTC
GCACGACTCG CCAAGCTCTC CCCGGCGGTC GGAGGGCCTT ATGCATATAC GCGCATAGCC
TATGGAGACT TCCCGGGATT TCTTATCGCT TGGGGATATT GGATTTCCAT CTGGGCGTCG
CTGCCCGTTA TCGCAGTGGC GTTCGCCGGC GTGGTCATCG ATTTTTTTCC GATCCTCCGC
GGGCGCGGAA CGGCGACGCT GCTCACGTTG AGCGTGATCT GGCTTGTCGT GCTCGTCAAC
TTACGCGGCG TCCACGCGGC AGGGCTCTTC TCCGAAATCA CCACCTACGC TAAGATGATC
CCGTTCGGGG CCGTCGCGCT GCTGGGCCTG TTTTACATCG ACTTCTCCCA CTTCGCCGAC
TTCAATCCGA GCGGCCAGCC GCTCCTTCAG GCGAGCGCTG CGTTGGCGCC GCTCACCATG
TTCGCCTATC TGGGGCTTGA ATCTGCCACG GTGCCCGCTG GCGATGTGCG CGACGCCGAA
CGTACGATCC CGCGTTCAAC GGTGCTTGGA ATCTCCATTG CTGTAACGCT GTACGTTCTC
GGCACCATTG TCGTTATGGG GTTGGTGCCG AGAGAGGAGC TCGTCCACTC GGTGGCACCT
TTCTCCGAGG CAGCAAGGAG AATGTGGGGA CCGGCCGGTG AACTAGCGAT TTCCCTGGCA
GTTGTCCTGT CGTCGATCGG AGCGCTGAAC GGCTGGACCT TGCTGATGGG TCAGGTGCCA
ATGGCGGCGG CGCGAGACGG ATTGTTTCCA CCGCTGTTCA GCCGGCTCTC GGCGCGAAGT
GTGCCCGCCA CGGGGATTGT CGTTTCGGCG ACTCTGGCGA CAATTCTCGT GCTCGTTCAG
GCAGCCGGTT CCGAGGGCTT CTCATCCATT TATCGGCTAT TCGTCGGCTT GAGCACAATG
ACGGCCGTTA TACCTTATGC GTTCTGCGCT CTTGCCAGCA GTCTGGTCTC AGCACGGGTT
AGCGGAGGGA CTGTAATACC GCGTGTAACC CTTATCGAGC TTGTTGGTTT CGCTTTTGCA
ATGTTCACGC TTTACGGCTG TGGTGCGGAG CCTGTCCTCT ATGGACTGAT GCTGCTGTTG
CTGAGCATCC CCGTTTACAT ATGGCAGCGA CGTCGGAGCT TCGTGCCGGG TGATTTCGGC
CAATGA
 
Protein sequence
MSAPSSGKSL GLAACTAIVV GNMVGSGFYL SPAAVAPYGN LAIVIWIVMG AGAICLGLTF 
ARLAKLSPAV GGPYAYTRIA YGDFPGFLIA WGYWISIWAS LPVIAVAFAG VVIDFFPILR
GRGTATLLTL SVIWLVVLVN LRGVHAAGLF SEITTYAKMI PFGAVALLGL FYIDFSHFAD
FNPSGQPLLQ ASAALAPLTM FAYLGLESAT VPAGDVRDAE RTIPRSTVLG ISIAVTLYVL
GTIVVMGLVP REELVHSVAP FSEAARRMWG PAGELAISLA VVLSSIGALN GWTLLMGQVP
MAAARDGLFP PLFSRLSARS VPATGIVVSA TLATILVLVQ AAGSEGFSSI YRLFVGLSTM
TAVIPYAFCA LASSLVSARV SGGTVIPRVT LIELVGFAFA MFTLYGCGAE PVLYGLMLLL
LSIPVYIWQR RRSFVPGDFG Q