Gene Smed_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2738 
Symbol 
ID5323608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2850614 
End bp2851873 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID640791683 
Productvon Willebrand factor type A 
Protein accessionYP_001328403 
Protein GI150397936 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGAC GTAACAATGC AGGCCTGTCC TTCATGAGGA TGCTGCGGGA TCGAGGCGGG 
AATTTCGGGA TGATGACGGC GCTGGTCGCG CCGCTACTCC TTGCCGTCGG CGGGGTATCC
GTCGACGTCG CCAACATGCT GATGACCAAG AACCAGCTTC AGGACGCAAC CGACGCGGCA
GCGCTCGCGG CCGCCTCCGC TCTCGTATCC GACGCGAGGC CAGACATCGA AGAGGCAAAG
GACCTCGCGC GCAAGTTCCT GAAGACGCAG GCGGCAGCGG CGACCGCGTC GGACCTCCCG
GACGAGGGGC CGTCGATAGG GGCGCGCGGC GGCGGGAATG CGGATGACGA AGTACCTGCG
ACGCCCCGGT GGGAGGATGT GAATGCTACG GAAATCGACA TCACCGCGAC GCCGAACGGT
GCAAAGGGGA AGTCTTTCCA GGTTACCGTC GCCAACAAGC ACCTGCTCCA GTTCAATGCC
ATGACGCGTC TGCTCGGCCC GGAGTCGATC GAGATCGAAA CCCGATCCAC CGCCGAGAGC
GCGACGGAGA GCAAGAACGC CCTGTCCATG TATCTGGTGC TCGACCGGTC CGGGTCGATG
GCGTGGAAAA CCAACACGAT AAACACAGGC AAGGCGAAAT GCCCCAACTA CACGGAGGCG
AACTGGAGCA AGTATCCGGA CCTCAAGGCT ACCGGCCCCT GCTATGTAAC GAAGATTGAT
GCCCTGAAGA CAGCGGTTGG CGACCTCCTC GCCCAGCTTG TCACGGCGGA CCCGGAATCG
GCCTATGTCC GCACCGGTGC GATCTCCTAC AATTCCGCCC AGGACGCGGC GAGCAGTCTT
TCCTGGGGAA CGAGAGGTGC AGCCGGTTAT GTCGACGCCC TGGTCGCCAT AGGCGGGACC
GCCTCCGGCA ACGCCTTCAA GACCGCGTTC CAGAAGGTCA CCAACGCTGC GGAAGACAGC
GAGCACGGTG CAAAGAACGG TCAGGTGCCG ACGAAGTACA TCGTGTTCAT GACCGATGGC
GAAAACAACC ATGCCAATGA CGACACCGTC ACCAGGCAGT GGTGCGACAC AGCCAAAGCA
AGCAAGGTCC AGATCTACAG CGTTGCATTC ATGGCGCCGG ATCGCGGCCA GAAGCTGCTG
AAGTCCTGTG CTTCGTCTTC CTCCCACTAT TTCGAAGCGG AGGAGGCGTC CGATCTCGTC
GCCGCCTTCA AGGCGATCGG CGAACGCGCG GCCGCGTCGG TATCCCGCTT GACGAAATGA
 
Protein sequence
MGRRNNAGLS FMRMLRDRGG NFGMMTALVA PLLLAVGGVS VDVANMLMTK NQLQDATDAA 
ALAAASALVS DARPDIEEAK DLARKFLKTQ AAAATASDLP DEGPSIGARG GGNADDEVPA
TPRWEDVNAT EIDITATPNG AKGKSFQVTV ANKHLLQFNA MTRLLGPESI EIETRSTAES
ATESKNALSM YLVLDRSGSM AWKTNTINTG KAKCPNYTEA NWSKYPDLKA TGPCYVTKID
ALKTAVGDLL AQLVTADPES AYVRTGAISY NSAQDAASSL SWGTRGAAGY VDALVAIGGT
ASGNAFKTAF QKVTNAAEDS EHGAKNGQVP TKYIVFMTDG ENNHANDDTV TRQWCDTAKA
SKVQIYSVAF MAPDRGQKLL KSCASSSSHY FEAEEASDLV AAFKAIGERA AASVSRLTK