Gene Smed_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3801 
Symbol 
ID5318099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp250914 
End bp251918 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content63% 
IMG OID640775614 
Productvon Willebrand factor type A 
Protein accessionYP_001312547 
Protein GI150375951 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.822741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTCC TTGATCATCC CTGGCTCCTG CTGCTTCTGC CGGCACCCCT GTTCGTCTGG 
TGGCTGCTGC CGCCCTATCG GGAACAGACG CCGGCAGTGC GCATTCCCTT TTTCGAGGAC
ATCACCCGGG CCGCTGGCAT CGGCCCGACG GAAGGCTCGG TCGTGCCGCG TGCCAACCTC
CTGCAGAAGA TCATCGCGCC GATCTGCTGG CTCCTGGTGT TGACGGCGCT TGCCCGGCCG
CAATTCGTAG AGCCGCCGAT CGAGAAGACC GAGCCCCAGC GCGATCTGAT GCTCGCGCTC
GACCTCTCGC AATCGATGGA CACGCGCGAC TTCAGCGACC CGCAGGGCAA TCTTCAGGCG
CGGGTAGATG CGGTGAAGAC CGTGGTGGCA GACTTTGTCG ATCGCCGTCC GTATGATCGC
CTCGGTCTCG TCGCTTTCGG TGACGCCCCC TATCCGCTCG TTCCCTTCAC CATGGATCAT
GCCACCGTCC GGTCCATGCT GACCGGCGCC TTACCGGGCA TGGCCGGCCC AAAAACGGCT
CTCGGCGATG CGCTGGGGCT TTCGATAAAA CTGTTCCAGC AGAGCCAGGC TCCTGACAAG
GTGCTGGTCG TTCTGACCGA CGGCAACGAC ACCGCCAGCA AGATGCCGCC GGACAAGGCC
GCCGAGATTG CGAGCCAGAA CCACATCCGT ATTCATACGG TCGGCATCGG CAATCCCGAC
GCCCAGGGAG AGGAAAAGCT CGATACCGAG ACGCTGCAAA AGATCGCCAC GGCTACCGGA
GGACGCTATT TCTTCGGTCA GGACCAGCAA GCGCTCGCCG AGATATACAC GCTGCTCGAC
AGCATCACAC CGGCGAACCA GAAAACGCTG AGCTGGCGCC CGCGCATCGA GCTGTTCCAC
TACCCACTCG GCGCTGCCGT CCTCCTCGTA CTCGGCTATC ATGCCCTAAT GTGGCTTCTC
TCGGTTAGTG CCGCCCGCAG GCGAAACAAC GAGGCTGAAG CATGA
 
Protein sequence
MYVLDHPWLL LLLPAPLFVW WLLPPYREQT PAVRIPFFED ITRAAGIGPT EGSVVPRANL 
LQKIIAPICW LLVLTALARP QFVEPPIEKT EPQRDLMLAL DLSQSMDTRD FSDPQGNLQA
RVDAVKTVVA DFVDRRPYDR LGLVAFGDAP YPLVPFTMDH ATVRSMLTGA LPGMAGPKTA
LGDALGLSIK LFQQSQAPDK VLVVLTDGND TASKMPPDKA AEIASQNHIR IHTVGIGNPD
AQGEEKLDTE TLQKIATATG GRYFFGQDQQ ALAEIYTLLD SITPANQKTL SWRPRIELFH
YPLGAAVLLV LGYHALMWLL SVSAARRRNN EAEA