Gene Smed_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0044 
Symbollnt 
ID5320871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp45522 
End bp47153 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content61% 
IMG OID640788975 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001325739 
Protein GI150395272 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00386393 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000481156 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCTTGCC CACGACCGGC GCCGACTGGA GGTTCAATGG AAAGACTGGC AGGCAGAATC 
ATACTTCTCT CCGGGGTGTC GCGGGCATTC GTCGGTTTTC TCGCCGGCCT CCTGGCGGTG
CTTGCCCAGC CGCCATTTGG CATTTTCGCG GCGGCTTTCG TCTCTTTTCC AGTCCTCGTC
TGGCTGATCG ACGGGGTGGC GCCCGATCCC TCCGACGGCG CATTCCGGCG GCTGAGGCAG
CCCGCCGCAA TCGGCTGGTC CTTCGGCTTC GGCTATTTTC TGGGCGGTCT CTGGTGGCTG
GGCAATGCGC TCCTGGTCGA AGCGGACGCG TTCGCCTGGG CGATACCCCT TGCCGTCGTC
GGCCTTCCAG CCGTTCTCGG GGTTTTTTAC GCGCTGGCGG TCGTCATTGC CCGCTGTCTT
TGGTCCGACG GCTGGGGCCG GATCGCTGCC CTTGCGCTCG GCTTCGGCAT CGCCGAATGG
CTCCGCGGTT TTGTTTTTAC CGGCTTTCCG TGGAATGCCA TCGGTTATGC GGCCATGCCG
ATGCCGTTGA TGATGCAGTC GGCAAGCGTC GTCAATCTCT CAACGATCAA CATGCTGGCC
GTCTTTGTGT TCGCCGCTCC TGCTTTGATC TGGACGGGCA AGGGCGCGCG CACCGGCCTT
GCCATCGCGG TAGCGCTCTT TACGGCGCAT ATCGCATTCG GCTTCTACCG GCTCGCTCAG
CCGGCGCCGC CGTCGGCTGC ACCGCAAATG GCGGTACGCG TCGTACAGCC GGTCATCGAC
CAGGCCAAAA AGCTCGACGA CCGCGAGCGC GCCTCGATCT TCGAGGACCA CCTCTCATTG
ACGGCCGCCC CGGTTCAAGG TGGCGGCAAG CGTCCGGACA TCGTCGTTTG GCCGGAAACG
TCGATCCCTT TCATCCTCAC CGACAATCCC GACGCGCTGG CGCGGATCGC GGAGGTTCTC
AAGGATGGGC AGATACTCGT CGCCGGCGCC GTCAGGGCCG AGGATGCAGG CGCCGGGCTG
CCGTCGCGCT ACTATAACTC TGTCTATGTT ATTGACGACC GGGGCCAGAT CATTGGCGCG
GCGGACAAGG TGCATCTGGT GCCGTTCGGT GAATATCTCC CCTACGAGGA CCTGCTGACG
TCCTGGGGCT TGAGTTCCAT CGCGGCTTCG ATGCCGGGCG GCTTCTCGGC AGCCAGGATG
CGCCCTGTGC TCACTTTGCC GGGCGGCAGA AGACTTTACC CGATGATCTG TTACGAGGCG
ATCTTTGCCG ATGAGGTGGA CGCCAATGCG CGCCTCGCCG ACGTGCTCCT CAATGTCACC
AACGATGCCT GGTTCGGTGA CACGCCAGGT CCGCGCCAGC ATTTCCATCA GGCGCAGCTC
CGCGCGGTTG AAACCGGAAT TCCCATGATC CGCGCTGCGA ATACTGGTAT TTCAGCAGTT
GTTGATGCAC GTGGTGTTTT AGTGTTAGTA TTAGGCTACA ATTACAGGGG TGTTTTAGAC
ACAATTCTGC CGGGAAAACT GCCTACGCTA ACGGACGTCC CGACGCGCAG CCGGATTTTT
TGGTTGTCGA TGGCTATTCT ATCTATAGTT GCATCATTCT CGCGTTTTGG TTTCAATATT
AGGAAGAATT GA
 
Protein sequence
MPCPRPAPTG GSMERLAGRI ILLSGVSRAF VGFLAGLLAV LAQPPFGIFA AAFVSFPVLV 
WLIDGVAPDP SDGAFRRLRQ PAAIGWSFGF GYFLGGLWWL GNALLVEADA FAWAIPLAVV
GLPAVLGVFY ALAVVIARCL WSDGWGRIAA LALGFGIAEW LRGFVFTGFP WNAIGYAAMP
MPLMMQSASV VNLSTINMLA VFVFAAPALI WTGKGARTGL AIAVALFTAH IAFGFYRLAQ
PAPPSAAPQM AVRVVQPVID QAKKLDDRER ASIFEDHLSL TAAPVQGGGK RPDIVVWPET
SIPFILTDNP DALARIAEVL KDGQILVAGA VRAEDAGAGL PSRYYNSVYV IDDRGQIIGA
ADKVHLVPFG EYLPYEDLLT SWGLSSIAAS MPGGFSAARM RPVLTLPGGR RLYPMICYEA
IFADEVDANA RLADVLLNVT NDAWFGDTPG PRQHFHQAQL RAVETGIPMI RAANTGISAV
VDARGVLVLV LGYNYRGVLD TILPGKLPTL TDVPTRSRIF WLSMAILSIV ASFSRFGFNI
RKN