Gene Smed_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3604 
Symbol 
ID5318438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp32198 
End bp33739 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content62% 
IMG OID640775418 
ProductABC transporter related 
Protein accessionYP_001312351 
Protein GI150375755 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.88039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG AGACCGTGCT CGACATCCGC AATGTCAGCA AGCACTTCGG CGCCGTGAAG 
GCCCTGACAG CCGTAAACTT CCGGCTCGAG CGGGGCGAGG TTCACGCGCT CTGCGGCGAG
AACGGCGCCG GCAAGTCGAC ATTGATGAAC GTCATCGCCG GCGTGCTGCA GCCGTCGGAA
GGAGAAATCC TCGTCGAGGG AGCGCCGGTG AGGATCGCTT CGCCCGCGGT CGCGCAATCG
CTCGGTATCG GGCTCGTCCA TCAGGAGATA GCGCTCTGCC CGGACGCGAC GATCGCGGAA
AACATGTTCA TGGCGGCGAC CAACCGCCGG CGCTCGGTTC TGATGAATTA CCGCAAACTG
GAGCGGGACG CCCAAATCCT GATGAACCGT CTGGCGCCGA TCGATGTCCG TCAGAAAGTC
GGCGACCTAT CGATCTCCAG CCAGCAACTC GTCGAGATCG CCAAGGCGCT GACGCTTGAC
TGCCGCGTGC TCATTTTCGA CGAGCCGACG GCGGCGCTGA CCGAGACCGA AGCTCAGGTC
CTCTTCGGCA TCATCCGCGA TCTCAAGGCG CGCGGCATCT CGATCATCTA TATCAGCCAC
CGCATGGCGG AAGTGTTCAG CCTTTGCGAC CGGGTGACGG TTTTCCGCGA CGGACGCTAT
GTTGCGACCG AAACCGTTGC GGACATCACC CCTGACGACG TTGTCCGGCT CATGGTCGGC
CGCGAGATCG ATCAGCTTTA TCCGGAGAAG CACGGCCAGT CGGGGTGCGT CGGCGAGCCC
ATATTGTCGG TCAGGAAGCT CGGTGACGGC GCGCGTTTCC GCGACGTCAG CTTCGAGCTG
CGCCACGGCG AAATCCTTGG CGTTGGTGGC CTGATCGGTT CCGGGCGGAC GGAAATCGCC
GAGGGCATAT GCGCGCTGCG CGCGGCGACG CAAGGCGAAA TCCGGCTTCA TGACAAGGTA
CTTCGACTGC GCCGCTATTC CGATGCAGCG AAGGCCGGCG TCGTCTATCT TTCAGAGGAT
CGGAAAGGCT CCGGCATATT CCTCGAAATG TCGATCGCTC AGAACATTGC CGCGCTCGAT
CTGAAAGCGT TGACCTCATT CGGCCTGCTC AATTTCCGCA AGGAAAGATC GCTCGCCGAA
GAATTGGCCC GCAGACTTGG CGTGCGGATG GGCGGTGTCG ATATGCCGGT CTCTTCGCTT
TCGGGCGGCA ATCAGCAGAA GGTGTCGATA GCCAAGCAGC TGGCCGTCAA CCCGAAGGTG
ATCCTGTTGG ACGAACCGAC GCGTGGGATC GATGTCGGCG CCAAATCGGA AATTCACCGA
TTGCTGCGCG ATCTTGCCCG CGCGGGTATC GGCATCCTTG TCATCTCCTC CGAACTGCCG
GAGCTCATCG GCCTCTGCGA CCGGGTCCTG GTAGTGCGCG AAGGCGGTAT AGCCGGCGAG
GTCTCGGGCG ATGAAATGAC CGAGGAGGCG ATCATGCGGC TTGCATCCGG CATCGGTCCG
GAGGCCAACA CGAATTCAAA GGCATCCGGG CATGCGGCCT GA
 
Protein sequence
MTSETVLDIR NVSKHFGAVK ALTAVNFRLE RGEVHALCGE NGAGKSTLMN VIAGVLQPSE 
GEILVEGAPV RIASPAVAQS LGIGLVHQEI ALCPDATIAE NMFMAATNRR RSVLMNYRKL
ERDAQILMNR LAPIDVRQKV GDLSISSQQL VEIAKALTLD CRVLIFDEPT AALTETEAQV
LFGIIRDLKA RGISIIYISH RMAEVFSLCD RVTVFRDGRY VATETVADIT PDDVVRLMVG
REIDQLYPEK HGQSGCVGEP ILSVRKLGDG ARFRDVSFEL RHGEILGVGG LIGSGRTEIA
EGICALRAAT QGEIRLHDKV LRLRRYSDAA KAGVVYLSED RKGSGIFLEM SIAQNIAALD
LKALTSFGLL NFRKERSLAE ELARRLGVRM GGVDMPVSSL SGGNQQKVSI AKQLAVNPKV
ILLDEPTRGI DVGAKSEIHR LLRDLARAGI GILVISSELP ELIGLCDRVL VVREGGIAGE
VSGDEMTEEA IMRLASGIGP EANTNSKASG HAA