Gene Smed_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4043 
Symbol 
ID5318610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp505436 
End bp507154 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content62% 
IMG OID640775851 
Productphospholipase D/transphosphatidylase 
Protein accessionYP_001312784 
Protein GI150376188 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGC TGCACCGCGA GAGCCCTGGG AAGCTTTATT CGGCAGCTAT GCTGTTGCTG 
GAACAAATCA CGGTTGAAAG GGATTCATTT CCAATCGGAA TGCACGGGAT AGGAAGCTTG
GACTGGAAAA ACGCGCTCAC CGGAAATACG GCCGGCCGAT TGCGACGGGC AAGCCGGCCG
ATTATCAAGG AGGCCGAAAA TGTCTGGCGC AGCGCGCCCG CACGACATCT CTCGTTCCTC
GTGGATGCAG CAGCCTATTA TGCCTGTCTC GACACCATGT TCGAGGAAGC GGAAGAGCAG
CTGTGGATCA CCGGCTGGGA CTTCGATCCG CGCATCAAGC TCAGGCCGGA AGACCCGCAT
GCGGAATCCC TTGGAAGCAC GCTTGAGCGG CTGGCTGCGC AGAAGCCCGA CCTGAAAATC
CGCATTCTCA TTTGGGCGAT GGGCCCCATT TATTCGGGAA AGTCGCTCAG GCTTTTCCGC
AAGCAGCAAT GGGCAGCGCA TCCGCAGATC GAGCTCCGCT TCGCGAGCCA TCGCGCGCTG
CGAGGGTCGC ATCATCAGAA GCTCGTCTGC ATCGATGACA GGATCGCCTT CGCGGGGGGC
ATCGACCTGA CGGCGCGCCG CTGGGACACG CCGGAACACG CGGCAGAGAA CGAGTTGCGT
CGAGATCCGG ACGGCAAGCC TTATGACCCG GTGCATGACA TCCAGGCGAT TGTCGAAGGC
GAGGCGAGCC GCGCGATCGG TGATCTCTGC CGCGCCCGCT GGACAGCCTC CACCGGCGAA
GAAGTCGAAG CTCCGCGTGC GAAGGCATCA AAAGGCGCGC GCACATGGCC ATGGCCCAAC
GGCACCGTGC CGATCCTCGA AAATTGCCCG GTCGCAATCG CCCGGACCGA GCCCGGCTCC
GGCAAAAAGC GCGCCCGCCG GGAAGCATTG CGGCTGACGC TTGACGCCTT GCGCAGCGCG
CGTCGCCACA TCTATATCGA AAACCAGTAT TTCGCGTCCG GAAGGATAGG GCAGCTGCTC
TGCGACCGGC TGCAGGAGCC GGACGGCCCG GAAGTGGTGA TCATCACGAC CCGAAGCTCG
CATGGGCTGC TGGAACGCAT CGTCATGGGC GGCAACCGCG ACCGTCTCAT TCGACGGCTC
ACACAGGCTG ACCGTTACGG CAGGCTCAGG GTTGCCTATC CGGCCGTTCC CGCCCCCGAC
GGATCCGAGC AGGAGGTGAT GATCCATTCC AAGGTGGTCG CGATCGACGA CCGCTTTTTC
CGGGTCGGTT CGTCGAACTT CAACAACCGC TCGGAAAGCC TCGACACCGA ATGCGATGTT
GCCGTGGAAG CCGCCAATGA AGGACACCGC GCGGCAATTG CCAAAATACG CAATGGCCTG
ATCGCAGAAC ATCTCGACGT CCATGCGGAC GCCTTCGCAG AGGCCCTGAG GGAAACGAGC
TCCCTCATAG CCGCCATAGA CAGGTTGAAC ACGCGCCCGC GCGGCATACG CAGCTTTGAC
GGAATCGACA ATGGCGGCGC GACCGATCTG GTCTGGGGAA CGGAGATCAT CGATCCGCAG
CGGCCGATCC GGCCCTTTTA TCGCACGCAC AAGCTGCTCA GGCGCTGGGT CGGTCAGCTT
TTCGCCTTGC TCGCGAGGCT CTTATCGTCG TCGCGACGGG CAGCGAGCTC CGCAACGGAC
AGCGATATCA AGCCCAGCGG CAGCGGCAGG AAGAAATAG
 
Protein sequence
MAVLHRESPG KLYSAAMLLL EQITVERDSF PIGMHGIGSL DWKNALTGNT AGRLRRASRP 
IIKEAENVWR SAPARHLSFL VDAAAYYACL DTMFEEAEEQ LWITGWDFDP RIKLRPEDPH
AESLGSTLER LAAQKPDLKI RILIWAMGPI YSGKSLRLFR KQQWAAHPQI ELRFASHRAL
RGSHHQKLVC IDDRIAFAGG IDLTARRWDT PEHAAENELR RDPDGKPYDP VHDIQAIVEG
EASRAIGDLC RARWTASTGE EVEAPRAKAS KGARTWPWPN GTVPILENCP VAIARTEPGS
GKKRARREAL RLTLDALRSA RRHIYIENQY FASGRIGQLL CDRLQEPDGP EVVIITTRSS
HGLLERIVMG GNRDRLIRRL TQADRYGRLR VAYPAVPAPD GSEQEVMIHS KVVAIDDRFF
RVGSSNFNNR SESLDTECDV AVEAANEGHR AAIAKIRNGL IAEHLDVHAD AFAEALRETS
SLIAAIDRLN TRPRGIRSFD GIDNGGATDL VWGTEIIDPQ RPIRPFYRTH KLLRRWVGQL
FALLARLLSS SRRAASSATD SDIKPSGSGR KK