Gene Smed_4390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4390 
Symbol 
ID5318360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp884516 
End bp886264 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content64% 
IMG OID640776194 
Producttype I secretion system ATPase 
Protein accessionYP_001313127 
Protein GI150376531 
COG category[R] General function prediction only 
COG ID[COG4618] ABC-type protease/lipase transport system, ATPase and permease components 
TIGRFAM ID[TIGR01842] type I secretion system ABC transporter, PrtD family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.500904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAT CCAACGGCAG GAATGCCGAC CCAGCCTCTG CCTTGCGGGA TTGCCGAACG 
GCTTTCATCG GTGTTGCCAT CGCAAGCGCG CTCGTCAACG TTCTATACCT CACCGGGTCG
TTCTTCATGC TCGAAGTCTA CGATCGAATT CTGCCGAGCC GCAGCATTCC GTCCCTGGTT
GCTCTCTCTC TTCTCGCGCT GCTGCTTTAT GCCTTTCAGG GAGCCTTTGA GCTCGTTCGC
GGGCGGATGC TGGTGCGCAT CGCCGGTGCC CTCGACGAGA GCCTGAACGG TCGCATTTAC
CGCGCCATCG TGAAGGCGCC GCTGAAGCTC AGAATGCAGG GGGACGGTCT CCAGGCGCTG
CGCGATTTCG ATCAGGTTCG GTCGTTCCTG TCGGGTGCCG GCCCGGCCGC GCTCTTCGAC
CTGCCCTGGC TGCCCTTCTA CATCGCGATC TGCTTTCTTT TCCACCCGGT CATCGGTTTG
GTCGCGATCA TCGGCGGCCT GGTCCTGATG TTGCTCACCT ATCTCACCAA CCGCGGCACC
CAGGCGCCTG CCAGGAAAGC CTCGGAGGCC GGAGGGCTTC GCAACGCCTT CGCGCAGGCC
TCCCAGCGCA ATGCCGAGGT GGTTCATGCC ATGGGAATGT CAGCGCGGCT GACGGCGATG
TGGGAGAGGC GCAATACGGA GTTCCGCGAT GAGAACCGCC GCACCTCCGA TATCGGCAAT
GGCTACGGCG CGTTGTCGAA GGTCTTTCGC ATGGCGCTGC AGTCCTGCGT TCTGGCGGCC
GGCGCCGTTC TGGTGATACG AGGCGAAGCT TCGCCCGGGA TCATCATTGC GGGCTCGATC
CTGACAGCCC GGGCTCTTGC GCCCGTGGAA CTTGCCATCG GCAACTGGCG CGGCCTCGTC
GCAGCGCGCC AGAGCTGGCA ACGCCTCAAG GAATTGCTCA AAGCCCTGCC GGAAGCCGAT
GCACCACTCC AGCTTCCGAC CCCGCGCGAT CGCCTCACCG TCGAAGGGCT GGCAAGCGGT
CCGCCGGCGG CGCAGCGCCT CATCTTGTCG GATGTGAATT TCACGGTCGG TGCGGGCGGT
GCCGTGGGAG TCATAGGACC GAGCGCTTCG GGAAAATCGT CTCTGGCGCG CGCGATCCTC
GGTATATGGC CGGCCTATCG CGGCTCGGTT CGGCTGGACG GTGCAGCCCT CGATCAATGG
GACAGCGATG AACTCGGGAA ACACATCGGC TACCTGCCGC AGGACGTGGA ACTGTTCGCC
GGGACGATCG CGCAGAACAT CTGCCGTTTT GCCGAAGACG CGACACCGGA CGCGATCGTC
GCCGCCGCAA AGGCGGCGCG CGTCAACGAT CTGATCCTCC GGCTTCCGAA CGGCTATGAC
ACCGAGATCG GCGATGGCGG CATGACGCTC TCGGCCGGCC AGCGCCAACG GGTGGCTCTC
GCGAGGGCCC TTTACGGCAA TCCCTTTCTC GTCGTTCTCG ACGAGCCCAA TTCCAACCTC
GACGCCGAGG GCGAGCAGGC GCTCAGCGAA GCGATCATGA GCGTGCGCAG CCGTGGCGGC
ATCGTCATCG TGGTCGCCCA CCGGCCGAGC GCACTCGCAA GCGTCGATCT CGTGCTGATG
ATGAATGAAG GACGCATGCA GGCTTTCGGG CCCAAGGAGC AAGTCCTCGG TCAGGTACTT
CGTCCGCAAC AGGTGGAGCG ACAGAATTCG CTGAAAATCG TTGCAGAAGG GCAGGAGGCG
AAGCAATGA
 
Protein sequence
MATSNGRNAD PASALRDCRT AFIGVAIASA LVNVLYLTGS FFMLEVYDRI LPSRSIPSLV 
ALSLLALLLY AFQGAFELVR GRMLVRIAGA LDESLNGRIY RAIVKAPLKL RMQGDGLQAL
RDFDQVRSFL SGAGPAALFD LPWLPFYIAI CFLFHPVIGL VAIIGGLVLM LLTYLTNRGT
QAPARKASEA GGLRNAFAQA SQRNAEVVHA MGMSARLTAM WERRNTEFRD ENRRTSDIGN
GYGALSKVFR MALQSCVLAA GAVLVIRGEA SPGIIIAGSI LTARALAPVE LAIGNWRGLV
AARQSWQRLK ELLKALPEAD APLQLPTPRD RLTVEGLASG PPAAQRLILS DVNFTVGAGG
AVGVIGPSAS GKSSLARAIL GIWPAYRGSV RLDGAALDQW DSDELGKHIG YLPQDVELFA
GTIAQNICRF AEDATPDAIV AAAKAARVND LILRLPNGYD TEIGDGGMTL SAGQRQRVAL
ARALYGNPFL VVLDEPNSNL DAEGEQALSE AIMSVRSRGG IVIVVAHRPS ALASVDLVLM
MNEGRMQAFG PKEQVLGQVL RPQQVERQNS LKIVAEGQEA KQ