Gene Smed_5622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5622 
Symbol 
ID5319924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp589152 
End bp590147 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content62% 
IMG OID640777365 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001314297 
Protein GI150377702 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0882363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.711576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATG AACCATTGTT GTCCGTCAAG AATCTGACGG TTGATCTGCT GACCGCGAAA 
TCCGCGCTGC GCCCGGTTGA CGGTGTCAGT TACGCGATCC GGCAGGGTCA GTGTCTCGCC
ATAGTCGGCG AAAGTGGTAG CGGCAAGACC GTGATGAACT TCGCCCCGCT CGGCCTGATG
CCAACGGGCG TGGCGACGAA TCTCTCCGGC TCGGTACGGT TCGAGGGGCA GGAGCTGATC
GGCCTCTCCG AGCCGGAAGT CCGCAAGTTG CGCGGCAAGT CCATCGGGTT CATCTTCCAG
GATCCGATGA GCGCGCTTAA CCCGGTCAGA CGTATCGGGC GGCAGATCGC CGAAATGGCG
GAGCTGCATC TCAACATGAG CCCTCGCGCA GCAGAGGCGA GGGCACTCGA CCTCGTCAAA
CTTGTCGGAA TCTCAGATCC TGCCGCTCGG CTGAGTCAAT ATCCGCACGA ATTGTCGGGC
GGCCTAAGAC AGAGGATCGT CATTGCTATT GCACTTGCTG GAGAGCCGAA GCTGCTCATC
GCCGACGAGC CGACGACTGC CCTCGATGTC ACGGTGCAGG CACAGATTCT CCGCTTGCTG
AAGGACCTGC AGCAGCGCCT GAACATGGCG ATGGTCTTGA TCACCCACGA CATGGGCGTC
GTCGCCGGTG CGGCCGACAA CATCGTCGTC ATGTATGCGG CACGGGCTGC CGAATGCGGG
CCGGTGGACA AGGTCCTCGT CAATCCTCGT CATCCCTACA CAAGGGGCCT GATCAACGCG
ATTCCGAGGC GCGACGATCC GATCGGCTCC GAATTCCGGG GGCTGCCCGG TGTGCCCCCA
ACGCTCGGCG CACCCATCCA GGGCTGCGCT TTCGCGCCGC GCTGCGAGTT CGCAGTGGCC
GCATGCACTC GGGCGCGCCC ACCAATGGTC CCAACCGCCG ACAGCAATGT TTGCGTAGCC
TGCCCTGTCG TCAACCAAGG AAAGGCTGCG GCATGA
 
Protein sequence
MQNEPLLSVK NLTVDLLTAK SALRPVDGVS YAIRQGQCLA IVGESGSGKT VMNFAPLGLM 
PTGVATNLSG SVRFEGQELI GLSEPEVRKL RGKSIGFIFQ DPMSALNPVR RIGRQIAEMA
ELHLNMSPRA AEARALDLVK LVGISDPAAR LSQYPHELSG GLRQRIVIAI ALAGEPKLLI
ADEPTTALDV TVQAQILRLL KDLQQRLNMA MVLITHDMGV VAGAADNIVV MYAARAAECG
PVDKVLVNPR HPYTRGLINA IPRRDDPIGS EFRGLPGVPP TLGAPIQGCA FAPRCEFAVA
ACTRARPPMV PTADSNVCVA CPVVNQGKAA A