Gene Smed_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4026 
Symbol 
ID5318326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp484435 
End bp486075 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content63% 
IMG OID640775834 
ProductABC transporter related 
Protein accessionYP_001312767 
Protein GI150376171 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.697509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA CACATCTTCT CGAGGTCAAG GACCTGACCG TCGATTTCCT GTCGCTTGGC 
GGATCCTTCC GCGCGACAAA CGGCGTCAGC TTCCATGTCG ATGCCGGAGA GACGCTCGTG
ATCCTGGGCG AGTCCGGTTC CGGCAAATCG GTAAGCGCCA GTGCGATCAT GGGACTCATC
GACACGCCTC CCGGCGACAT CTGCGCGGGG TCGGTCGCCT ATCGCGGCCG CGATCTCTCC
TCGCTTTCGG AAGGTGAGCG GCGGGACCTC AACGGCCGCA AGATCGCGAT GATCTTCCAG
GACCCGCTTT CGCATCTGAA CCCGGTCTAT ACGATCGGCT GGCAGATGGA GGAGGTGTTC
AGCGTCCACG GAGTGGCGAG CGGTGGCGTG GCGCGGCAGA GGGCGATCGA TATATTGAGG
CGCGTCGGCA TCCCGGAGCC GGAGAAGCGC ATCGACCAGT ATCCGCACCA ATTCTCCGGC
GGCCAGCGCC AGCGCATCAT GATCGGCATG GCAATCGCGC TCAGGCCGGA AATCCTGATC
GCCGACGAGC CGACGACAGC GCTCGACGTG AGCGTGCAGG CCCAGATTCT CGAGCTTCTG
AAGAAGCTGC AGGCGGAGGA CGGCCTTGCC ATCATCATGA TCACCCACGA CCTCGAGGTG
GCCGCGAACA TGGCCGACAG GGTGATCGTC ATGAAATCCG GCCGCATCGT GGAAGAGGGC
GAGGCCCGTG CGGTCTTCGA AAACCCCGCC CATACCTATA CGCGGACGCT GATCAACGCG
CTACCGCATG GCGATCACGA AGCCCCGCCG AAACGCGGAA GGCCCGCTGG AAAGCCGATC
CTGGAAGTAA GGAACATCGA CAAGTTCTAC ACTCTCTCCT CCGGGTTTTT CGCGAAACCG
GCGCGGTTGC ACGCGGTCAA GAAGTTGAGC TTCGACGTTG CGGCAGGGGA GACCATCGGC
ATAGTTGGCG AGAGCGGCTC CGGCAAGTCC ACCGTCGCGC GCGTGCTTCT GGGTCTCAAC
GAAGCGTCCG GCGGAGAGGC GCTGTTCCAC GGGCGCGATA TCCTGAAGAT GGACCGCAAG
GAATTGCTGG CCTTCCGCCG GAAGGTGCAG ATGGTTTTTC AGGATCCCTA CAGCTCGATG
AACCCGCGCA TGACGGTGCT CGATATCGTT TCGGAGCCGT GGCGCATCCA CAAGGATATT
CTGGAGAAGC CCCGTTGGCG CGACCGCGTT ACTGAATTGC TGGGGCTCGT CGGCCTGAAC
CCCGAGCATG CCGCACGCTA TCCTCATCAG TTCTCAGGGG GGCAGCGGCA GCGCATCGCC
ATTGCCCGGG CGCTTGCCTG CGATCCGGAG CTCGTCGTGT GCGACGAGGC CGTGTCGGCG
CTCGACGTCT CGGTGCAGGT GCAGGTGATC GGCCTCTTGG CGGAATTGCG CGACCGGCTG
GGCCTCGCCT ACATTTTCAT AACCCATGAC CTGCCGATCG TGCGTCACTT CGCCGACCGG
ATCATCGTCA TGAAGAGCGG CGAGATCGTC GAGCATGCGA CGACGGAAGA GATCTTCCGC
AGTCCGCAGC ATGCCTATAC GCGCCAGCTC ATCAATGCGA CGCCGAAGCC GAAGTGGCAG
ACAGCCGCCG ACGCGGCATG A
 
Protein sequence
MTGTHLLEVK DLTVDFLSLG GSFRATNGVS FHVDAGETLV ILGESGSGKS VSASAIMGLI 
DTPPGDICAG SVAYRGRDLS SLSEGERRDL NGRKIAMIFQ DPLSHLNPVY TIGWQMEEVF
SVHGVASGGV ARQRAIDILR RVGIPEPEKR IDQYPHQFSG GQRQRIMIGM AIALRPEILI
ADEPTTALDV SVQAQILELL KKLQAEDGLA IIMITHDLEV AANMADRVIV MKSGRIVEEG
EARAVFENPA HTYTRTLINA LPHGDHEAPP KRGRPAGKPI LEVRNIDKFY TLSSGFFAKP
ARLHAVKKLS FDVAAGETIG IVGESGSGKS TVARVLLGLN EASGGEALFH GRDILKMDRK
ELLAFRRKVQ MVFQDPYSSM NPRMTVLDIV SEPWRIHKDI LEKPRWRDRV TELLGLVGLN
PEHAARYPHQ FSGGQRQRIA IARALACDPE LVVCDEAVSA LDVSVQVQVI GLLAELRDRL
GLAYIFITHD LPIVRHFADR IIVMKSGEIV EHATTEEIFR SPQHAYTRQL INATPKPKWQ
TAADAA