Gene Smed_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5038 
Symbol 
ID5319087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1558248 
End bp1559225 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID640776819 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001313751 
Protein GI150377155 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.642958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.861569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGA TTTCTGTTGT CCTCGGCAGG CTGTCGCAAG CCTTCCTGCT CCTCATCGCC 
ATGTCGCTGA TCGGCTTCAT TGGCGTCCAC AGCGTCGGCA ATCCGGTCTT CAATGTCGTC
AACATCGAAA CCGCCACGCC GGAGGATATC CGCCAGGCCA CGATCGCGCT CGGCCTCGAT
CAGCCGATCT GGCGCCAGTA CCTGCTTTTC ATCAACAATG TCGTGCGCGG AAACTTCGGC
ACCTCATATA TCTATCATCT GCCGGCCTTC GCGCTGGTCA TGAGCAAGCT GCCGGCGACC
CTGGAGCTCG CATGCGTCGC CATGCTTATC GCGGCGCCGG TTGGAACCGG CCTCGGGCTG
CTCGCCGGCC GGCGGAGCGG CACGATCTTC GACCGGACGG TCCTCAGATC GAGCGTCTTC
GCGCTGAGTA TCCCGTCATT CTGGCTGAGC ATGATGCTCA TCCTGCTCGG CGCCATCCTG
ACAGGCTGGT TTCCATCCGG CGGCCGCGGC ACGACGGCGC GGTTCCTTGG CCAGGAGTGG
AGTTTTCTGA CCGGGAACGG GCTCTGGCAC ATGGTCCTTC CTGCCCTGGC GCTGGCGATA
CCGAACGTCG CCTTGATCGC GCGGCTTTCG CGATCGGGCA CGATCGAAGT CGAGAACCTG
GACTTCACGC GGTTTTGTCG CGCGAAAGGG CTGTCCTCCC GAAGGATCCT GCTTCGCCAC
ACGCTTCCAA ACATCAGCGT GCCGATCGTG ACAATCATCG GCCTGCAATT TGGCGGCATG
CTCGCTTTTG CCGTGGTCGT GGAAACGATC TTCTCGTGGC CGGGCGTCGG CAAACTTCTG
ATCGACTCCA TCCAGCTCCT CGACCGGCCA GTGGTGATGG CGACGCTGAC CTTCATTGCC
GTCGCCTTCG TCGCGCTGAA TGCCCTCGTC GACTTGTTCT ATGCCGTGCT CGACCCGCGC
GTGCGCCTGT CTTCATGA
 
Protein sequence
MPLISVVLGR LSQAFLLLIA MSLIGFIGVH SVGNPVFNVV NIETATPEDI RQATIALGLD 
QPIWRQYLLF INNVVRGNFG TSYIYHLPAF ALVMSKLPAT LELACVAMLI AAPVGTGLGL
LAGRRSGTIF DRTVLRSSVF ALSIPSFWLS MMLILLGAIL TGWFPSGGRG TTARFLGQEW
SFLTGNGLWH MVLPALALAI PNVALIARLS RSGTIEVENL DFTRFCRAKG LSSRRILLRH
TLPNISVPIV TIIGLQFGGM LAFAVVVETI FSWPGVGKLL IDSIQLLDRP VVMATLTFIA
VAFVALNALV DLFYAVLDPR VRLSS