Gene Smed_0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0227 
Symbol 
ID5321059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp251880 
End bp252860 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID640789162 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001325921 
Protein GI150395454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.154077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0935895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCA AGCTCCTCAA GAACCGTGAA ATCCTGCTCG TCGTTGCGAT TGCGGTGCTG 
CTCGCTATCA TCGCCTTGCG CTTCCCGGCT TTCGTGGCGC CGTCCAATCT CGCGCGCGTC
TATAACGACA CCTCGATCCT CGTCATCCTG GCGCTTGGAC AGATGGCGGT TATTCTTACC
CGCTGCATCG ACCTGTCGAT GGCGGCCAAC CTCGCGCTCT GCGGCATGGT GGCAGCCATG
CTGAACAATT TCTTCCCCGG CCTGCCGATC CCACTCATCA TTTTTGCCGC CATGGCGCTG
GGCGGGTTCC TCGGCGCGAT CAACGGTACG CTGGTCTGGA AGCTCAACAT TCCGCCGATC
GTCGTGACCC TAGGGACTTT GACGATCTAC CGCGGTCTCA TCTTCGTTTT GACGAACGGC
AAATGGATCA ATGCGCATGA GATGAGCGAC CCCTTCAAGG CGCTGCCGCG GCTGGTCGTC
GCCGGCATGC CGGTGCTTTC CTGGCTCTCC TTCCTCATGA TAGCGCTGAT GTTCCTGGTC
ATCGGACGTA CGCCGCTCGG CCGCGCCTTC TATGCCGTCG GGGGCAATCC GCATGCGGCC
GTCTACACCG GCATCGATGT CGGCCGGACG CGCTTCTTCG CCTATTGTCT CTCGGGTACG
CTTGCGGGCC TGTCAGGTTA TCTCTGGGTA TCGCGTTATG CCGTCGCCTA TGTGGACATC
GCCGCCGGAT TCGAGCTCGA CATCATCGCG GCCTGCGTCA TCGGCGGCAT TTCGATTGCC
GGCGGCATCG GCTCCGTGGC TGGTGCGGTG CTCGGAGCAC TCTTCCTCGG CGTGATCAAG
AACGCGCTGC CGGTCATCGA TATCTCGCCC TTCGCGCAGT TGGCGATATC CGGAACGGTC
ATCATCATCG CGGTTGCCGT CAATGCCCGC GCCGAGCGGC GCAAGGGCAG GGTCATTCTC
AAGAAAGCGG AGGCGGTCTG A
 
Protein sequence
MMAKLLKNRE ILLVVAIAVL LAIIALRFPA FVAPSNLARV YNDTSILVIL ALGQMAVILT 
RCIDLSMAAN LALCGMVAAM LNNFFPGLPI PLIIFAAMAL GGFLGAINGT LVWKLNIPPI
VVTLGTLTIY RGLIFVLTNG KWINAHEMSD PFKALPRLVV AGMPVLSWLS FLMIALMFLV
IGRTPLGRAF YAVGGNPHAA VYTGIDVGRT RFFAYCLSGT LAGLSGYLWV SRYAVAYVDI
AAGFELDIIA ACVIGGISIA GGIGSVAGAV LGALFLGVIK NALPVIDISP FAQLAISGTV
IIIAVAVNAR AERRKGRVIL KKAEAV